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Work  done  under  contract  F49620-77-C-0005  on  defects  in  tetrahedral  semiconduc 
tors  may  be  divided  in  two  categories: 


1.  Studies  of  the  thermochemistry  of  point  and  line  defects:  Work  carried  out  in  this 
area  is  described  in  two  journal  articles:  "Formation  of  Interstitial-type  Dislocation  Loops  in 
Tetrahedral  Semiconductors  by  Precipitation  of  Vacancies",  by  J.  A.  Van  Vechten,  Phys.  Rev. 
B 17,  3197  (1978)  (also  given  as  an  invited  talk  at  the  APS  March  meeting  in  Chicago,  March 
21,  1979);  and  "Threshold  for  Optically  Induced  Dislocation  Glide  in  GaAs-AlGaAs  Double 
Heterostructures:  Degradation  via  a New  Cooperative  Phenomenon?",  by  B.  Monemar,  R.  M. 
Potemski,  M.  B.  Small,  J.  A.  Van  Vechten  and  G.  R.  Woolhouse,  Phys.  Rev.  Lett.  41,  260 
(1978).  These  two  articles  are  included  as  Appendices  A and  B in  this  report.  More  of  this 
work  will  appear  in  a forthcoming  chapter  by  J.  A.  Van  Vechten  in  Vol.  Ill  of  the  Handbook 
of  Semiconductors,  edited  by  S.  P.  Keller  (North  Holland,  Amsterdam,  1979  or  1980),  in  an 
invited  talk  entitled  "GaAs  Is  Different  from  Si",  to  be  given  at  the  AIME  Electronic  Materi- 
als Conference,  in  Boulder,  Co.,  June  24-29,  1979  (Appendix  C),  and  in  publications  still  in 
preparation  which  deal  with:  (a)  variation  in  the  rate  of  anion  and  cation  atomic  diffusion 
with  stoichiometry  of  host  III-V  crystals;  (b)  site  preference  and  compensation  ratio  of  Group 
IV  impurities  (C,  Si,  Ge  and  Sn)  in  III-V  crystals;  (c)  defect  migration  in  Si  during  pulsed 
laser  annealing  and  nonthermal  mechanisms  by  which  this  annealing  can  occur.  Subjects  (a) 
and  (b)  above  are  treated  in  the  aforementioned  handbook  chapter  and  will  be  treated  in  the 
EMC  invited  talk  while  subject  (c)  was  treated  in  two  contributed  talks  at  the  APS  March 
meeting,  March  30,  1979:  "Mechanism  for  the  Production  of  a Metastable  Plasma  in  Pulsed 
Laser  Annealing"  by  Ellen  J.  Yoffa  and  J.  A.  Van  Vechten;  and  "Defect  Migration  During 
Pulsed  Laser-induced  Plasma  Annealing"  by  J.  A.  Van  Vechten  and  Ellen  J.  Yoffa.  Summar- 
ies of  these  two  talks  are  included  in  this  report  as  Appendix  D. 

2.  Electronic-structure  studies  of  point  defects:  During  the  term  of  this  contract  we  reported 
the  first  self-consistent  calculation  of  the  electronic  structure  of  a neutral  vacancy  in  Si 


assuming  no  lattice  distortions.  The  main  results  are  contained  in  a paper  by  Bernholc,  Lipari 


and  Pantelides  (Appendix  E).  In  other  work  on  deep  impurities,  T.  N.  Morgan  has  investigat- 


ed the  capture  and  excitation  processes  at  the  O center  in  GaP  and  has  reinterpreted  existing 


data  (Appendix  F).  Work  has  also  been  carried  out  on  shallow  impurities.  S.  T.  Pantelides 


has  derived  and  discussed  new  effective-mass  equations  which  are  appropriate  for  materials 


with  multivalley  bands  (Appendix  G).  N.  O.  Lipari,  in  collaboration  with  A.  Baldereschi 


(Lausanne,  Switzerland),  carried  out  new,  accurate  calculations  of  acceptor  spectra  in  Si  and 


Ge  (Appendix  H).  N.  O.  Lipari,  in  collaboration  with  D.  L.  Dexter  (Rochester,  N.Y.),  carried 


out  a study  of  polarizabilities  of  impurities  in  the  presence  of  a magnetic  field  (Appendix  I) 


Finally,  and  J.  Pollman  and  N.  O.  Lipari,  in  collaboration  with  H.  BUttner  (Bayreuth,  German- 


y), studied  the  quenching  of  exciton  diamagnetic  shifts  in  polar,  layered  materials  (Appendix 


Formation  of  interstitial- type  dislocation  loops  in  tetrahedral  semiconductors 

by  precipitation  of  vacancies 


J.  A.  Van  Vechten 

IBM  Thomas  J.  Watson  Research  Center,  Yorktown  Heights.  New  York  10S98 
(Received  14  November  1977) 

It  it  hypothesized  that  when  vacancies  precipitate  to  form  voids  in  tetrahedral  semiconductors,  e.g  , Si  and 
GaAs,  reconstruction  reactions  occur  on  the  internal  surfaces  of  these  voids  in  the  same  manner  as  they  are 
observed  to  occur  on  external  surfaces  of  the  same  crystallographic  orientation.  Previously,  it  has  been 
concluded  that  many  of  the  various  reconstruction  reactions  observed  on  various  semiconductor  surfaces 
produce  hillocks  by  expelling  a portion  of  the  atoms  from  the  unreconstructed  (ideal)  surface  to  migrate  in  a 
reaction  front  across  the  surface.  Prom  these  two  lemmas,  it  is  here  concluded  that  the  corresponding  waves 
of  atoms  driven  by  reconstruction  on  internal  surfaces  will  precipitate  into  pillars  of  crystal-line  material 
within  the  void  and  produce  dislocation  loops  at  which  the  lattice  planes  bow  away  from  the  center.  Such 
dislocations  are  conventionally  denoted  “interstitial  type.”  This  mechanism  may  explain  several  observations 
of  “interstitial  type”  dislocations  in  semiconductors,  including  Si  and  GaAs,  for  which  there  is  much 
evidence  for  vacancies  and  no  other  evidence  for  self-interstitials. 


I.  INTRODUCTION:  VACANCIES  OR  INTERSTITIALS? 


interstitials, 19  which  provides  a large  driving 
force  for  their  removal,  (ii)  Several  impurities, 
such  as  Zn  in  GaAs,10  are  observed  to  diffuse 
as  interstitials  but  to  occupy  predominantly  sub- 
stitutional sites.  Therefore,  there  must  be  vacant 
lattice  sites  for  them  to  occupy,  (iii)  Diffusion 
and  related  phenomena  are  successfully  described 
by  ascribing  multiple  ionization  levels  to  the  native 
defect  which  are  the  same  as  those  observed  for 
the  vacancy  in  low- temperature  irradiation  exper- 
iments.4,21,22  (iv)  Theoretical  estimates  of  the 
heat  of  formation  of  vacancies  correspond  well 
with  those  of  the  native  defect  observed  in  quench- 
ing and  diffusion  experiments.21*15  As  the  thresh- 
old energies  for  Frenkel  pairs  in  irradiation  ex- 
periments are  quite  high,  the  energy  to  be  as- 
cribed to  the  heat  of  formation  of  the  interstitial 
is  much  higher  than  that  of  the  vacancy.1*  This 
is  also  in  agreement  with  simple  theoretical  ar- 
guments.1* 

Those  who  believe  that  self- interstitials  are 
the  dominant  native  defects  in  these  same  group 
IV  and  m-V  semiconductors  do  so  mainly  because 
the  great  majority  of  dislocation  loops  and  stack- 
ing faults  observed  by  transmission  electron  mic- 
roscopy (TEM)  in  these  materials  have  been  de- 
termined to  be  of  extrinsic  or  “interstitial”  char- 
acter.T*,1*,,'l*»,,«1T  Although  four  mechanisms  by 
which  interstitial- type  dislocations  may  be  formed 
without  any  self- interstitials  being  present  are 
well  established1  (and  discussed  in  Sec.  Q),  these 
authors  feel  that  in  several  cases  these  four  al- 
ternative mechanisms  do  not  obtain  and  that  the 
only  remaining  mechanism  is  the  precipitation 
of  self-interstitials. 

The  purpose  of  the  present  paper  is  to  analyze 


Surely  the  most  fundamental  question  to  resolve 
when  one  begins  the  study  of  deep  point  defects 
and  line  defects  in  semiconductors  is  whether  the 
dominant  native  defect  involved  in  the  formation  and 
motion  of  the  observed  defects  is  the  vacancy  or 
the  self-interstitial.  Despite  this  fact  and  the  in- 
tensive study  of  deep  point  and  of  line  defects  in 
several  semiconductors  over  many  years,  this 
question  is  still  being  debated  even  for  the  most 
thoroughly  studied  cases.  For  example,  in  the 
case  of  Si,  Refs.  1-6  refer  to  a few  of  the  recent 
papers  in  which  the  respective  authors  interpret 
diffusion  and  formation  of  dislocations  and  stack- 
ing faults  in  terms  of  vacancies,  while  Refs.  7- 
12  refer  to  a few  of  the  recent  papers  in  which 
these  phenomena  are  described  in  terms  of  Si 
self-interstitials.  A similar  situation  exists  in 
GaAs-AlGaAs,  where  some  authors  explain  the 
formation  of  dark- line  defects  in  heterostructure 
lasers  entirely  in  terms  of  vacancies, 11,14  while 
others15'1*  invoke  self- interstitials  to  explain  their 
formation. 

Those  who  believe  that  vacancies  are  the  dom- 
inant native  defects  in  group  IV  and  m-V  semi- 
conductors do  so  for  several  reasons:  (1)  Only 
vacancies  and  vacancy  complexes  are  identified 
by  electron  paramagnetic  resonance  in  these  ma- 
terials even  when  they  have  been  subject  to  elec- 
tron irradiation  and  one  may  be  certain  that  vac- 
ancy-interstitial (Frenkel)  pairs  were  produced.17 
The  absence  of  self- interstitials  in  irradiated 
samples  has  generally  been  ascribed  to  a rapid 
athermal  migration  of  self- interstitials, 11  even 
at  2 K,  and  to  a very  large  heat  of  formation  of 
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(Sec.  in)  the  mechanism  by  which  dislocations 
should  be  expected  to  form  if  indeed  vacancies 
are  much  more  numerous  than  sell- interstitials  in 
these  semiconductors.  It  is  concluded  that,  be- 
cause reconstruction  should  occur  on  interior  sur- 
faces as  it  is  known  to  occur  on  exterior  surfaces, 
the  precipitation  of  vacancies  will  cause  the  form- 
ation of  interstitial-type  dislocations  in  most,  but 
not  all,  experimental  situations.  The  possibilities 
to  verify  this  contention  experimentally  are  dis- 
cussed in  Sec.  IV. 

The  author  is  aware  that  his  proposal  is  at  odds 
with  the  conventional  wisdom  of  the  TEM-disloca- 
tion  community.  He  maintains  that  it  is  a reason- 
able conclusion  from  what  is  generally  accepted 
in  high-vacuum  surface  science  and  the  most  at- 
tractive resolution  of  the  dilemma  of  the  evidence 
for  vacancies  and  for  self- interstitials  where  it 
is  clear  that  they  cannot  both  be  present  in  signif- 
icant concentrations. 

n.  WHAT  AN  “INTERSTITIAL-TYPE" 
DISLOCATION  LOOP  IS 

When  an  electron  microscopist  says  that  a dis- 
location loop  is  "interstitial- type”,  he  means  that 
the  host- lattice  planes  bow  away  from  the  center 
of  the  loop.1*  When  he  says  the  dislocation  loop 
is  "vacancy  type",  he  means  that  the  host-lattice 
planes  bow  toward  the  center  of  the  loop  (see  Fig. 

1).  This  notation  was  popularized  by  Friedel,1* 
who  observed  that  if  a number  of  vacancies  were 
to  condense  to  form  a planar  void  of  sufficient  size, 
then  it  would  be  energetically  favorable  for  the 
material  to  remove  the  void  by  rewelding  the  sur- 
face on  either  side  together.  This  would  remove 
the  surface  energy  of  the  void  over  its  entire  ex- 
tent at  the  cost  of  deforming  the  lattice  inward  only 
around  the  perimeter.  Thus,  it  is  clear  that  for  a 
sufficiently  large  number  of  vacancies  in  a cluster, 
the  rewelding  will  reduce  the  total  energy  of  the 
solid.  It  should  also  be  clear  that  there  will  al- 
ways be  an  activation  barrier  to  be  overcome  be- 
fore this  rewelding  can  occur,  because  the  oppo- 
site planes  must  be  brought  together.  Further- 
more, if  foreign  atoms,  such  as  H,  He,  or  Ar, 
in  the  presence  of  which  crystals  are  often  grown, 
precipitate  into  the  void,  they  may  prevent  the  re- 
welding from  occuring. 

It  should  be  emphasized  that  the  electron  micro- 
scope detects  the  bowing  of  the  lattice  planes  rather 
than  the  presence  or  absence  of  host  atoms.  In  the 
absence  of  lattice  distortion,  a vacancy  would  have 
a scattering  potential  lor  electrons  differing  from 
that  of  an  interstitial  of  the  same  atomic  species 
only  by  a sign.1*  (The  potential  being  that  of  the 
atom.)  As  the  image  intensity  is  proportional  to 
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FIG.  1.  Schematic  representation  of  the  conventional 
view  of  the  conversion  of  a cluster  of  vacancies  (a)  into 
a “vacancy-type”  dislocation  loop  (b)  by  a rewelding  of 
the  surfaces  on  either  side  of  the  void.  (These  figures 
are  adapted  from  Friedel,  Ref.  28).  In  (c)  we  see  an 
“interstitial-type”  dislocation  loop  at  which  the  host- 
lattice  planes  bow  away  from  the  center  of  the  loop  as 
would  be  expected  If,  for  example,  self-interstitials 
were  to  precipitate. 


the  square  of  the  electronic  wave  function,  the 
image  contrast  produced  by  vacancies  is  the  same 
as  that  produced  by  interstitial  atoms.  While  it 
is  true  that  the  electron  beam  is  less  attenuated 
in  passing  through  vacancies  than  in  passing 
through  interstitials,  the  difference  of  one  atomic 
layer  more  or  less  in  a sample  typically  1 jim 
thick  is  generally  not  detectable  and  would  be 
masked  by  steps  and  adsorbed  layers  on  the  sur- 
face of  the  sample. 

Now,  it  has  long  been  recognized  that  the  fact 
that  a dislocation  loop  is  determined  to  be  "inter- 
stitial-type” does  not  imply  that  it  was  formed  by 
a precipitation  of  self-interstitials.1  Indeed,  four 
alternative  processes  have  been  firmly  established 
in  the  semiconductor  literature.  One  alternative 
process,  which  has  been  widely  observed  in  Si,  is 
the  conversion  of  inters titially  diffusing  impur- 
ities (typically  Cu,  Au,  or  LI)  into  substitutional 
impurities  by  absorption  into  interstitial  loops.30'33 
A second  process  is  the  emission  of  vacancies  by 
an  interstitial  loop  initially  present M,M  The  inter- 
stitial loop  grows  by  one  atom  for  each  vacancy 
that  is  emitted  from  its  boundary  and  diffuses 
away.  In  a third  mechanism.33  a precipitate  par- 
ticle (e.g.,  Cu  or  Fe  in  Si5’*  )is  formed  at  an  edge 
dislocation.  Compressive  stress  inside  the  pre- 
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cipitate  is  relieved  by  flow  of  vacancies  from  the 
dislocation  to  the  precipitate  particle.  This  flow 
may  talte  place  by  dislocation  pipe  diffusion  or 
volume  diffusion  and  is  accompanied  by  dislocation 
climb.  The  sense  of  the  climb  is  such  that  an  inter- 
stitial loop  will  grow  and  a vacancy  loop  will 
shrink.5’*’3*’3*  A fourth  mechanism  is  prismatic 
punching,”*3*  which  is  only  rarely  observed  in 
semiconductor  crystals  that  have  not  been  mech- 
anically damaged.3* 

There  is  a fifth  alternative  mechanism  which  has 
not  been  much  discussed  in  the  literature.  Be- 
cause these  semiconductors  expand  on  solidifica- 
tion, any  inclusion  of  the  liquid  phase  that  may 
be  trapped  in  the  solid  during  crystal  growth 
will  tend  to  punch  out  interstitial- type  disloca- 
tions when  it  does  freeze  .*°’*t  One  expects  in- 
clusions to  occur  when  the  crystal  is  growing  by 
means  of  reentrant  growth  steps.  When  the  growth 
steps  are  more  than  a few  atomic  layers  high 
(on  covalent  crystals  they  are  often  several  hun- 
dreds of  layers  high),  one  would  expect*3*43  to  find 
reentrant  growth  steps  even  if  the  treads  of  the 
growing  surfaces  of  the  crystal  do  not  reconstruct. 
The  tendency  to  form  reentrant  growth  steps 
would  be  much  increased  if  these  treads  do  re- 
construct.*1 

This  filth  alternative  mechanism  has  the  attrac- 
tion that  it  would  provide  a mechanism  for  the  nu- 
cleation  of  dislocations  in  crystals  of  such  high 
purity  that  impurity  induced  heterogeneous  nuclea- 
tion  is  not  effective.  It  has  been  noted  that  the 
homogeneous  nucleation  of  a dislocation  from  a 
concentration  of  single  vacancies  that  was  at 
equilibrium  with  the  surface  during  crystal  growth 
is  normally  an  extremely  unlikely  event.*4'*9  If 
one  assumes  the  self-interstitial  is  the  dominant 
native  lattice  defect,  the  same  considerations  as 
presented  in  Refs.  44  and  45  show  that  the  homo- 
geneous nucleation  of  an  interstitial  dislocation  is 
equally  unlikely.  It  is  usually  concluded  that  al- 
most all  observed  dislocations  were  heterogene- 
ously nucleated  by  impurity  precipitates,  but  many 
semiconductors  are  grown  so  pure  that  this  assump- 
tion should  be  questioned. 

01.  RECONSTRUCTION  ON  INTERNAL  SURFACES 

Reconstruction  reactions  are  first-order  phase 
transitions  that  are  observed  (in  high  vacuum)  on 
most  low  index  [e.g.,  (100),  (110),  and  (111)]  sur- 
faces of  group  IV  and  m-V  semiconductors  by  low- 
energy-electron  diffraction  (LEED),  photoemis- 
sion (PE),  and  other  techniques.4**53  It  is  general- 
ly agreed  that  these  reactions  occur  because  co- 
valently bonded  crystals  can  lower  their  free  en- 
ergy by  removing  the  dangling  bonds  on  the  ideal 


surface  (i.e.,  that  obtained  by  simply  truncating 
the  bulk  crystal  structure)  by  switching  to  a new 
crystal  structure  for  the  surface  layer  of  atoms . 
Whereas  the  atoms  in  the  bulk  of  these  semicon- 
ductors engage  in  s-p 3 hybridized  bonding,  those 
on  the  surface  may  engage  in  s-p1  hybridized  bond- 
ing or  p bonding  or  some  other  type  of  bonding  in 
order  to  minimize  their  free  energy. 

The  first  lemma  of  the  present  hypothesis  is 
that  similar  reconstruction  reactions  will  occur 
on  internal  surfaces  of  a crystal  that  are  formed 
by  the  precipitation  of  vacancies  at  voids,  vacan- 
cy complexes,  or  dislocations,  however  nucleated. 
As  long  as  the  opposite  sides  of  such  cavities  re- 
main sufficiently  far  apart  that  the  interaction  be- 
tween them  is  slight,  these  reconstructions  will 
surely  obtain.  In  order  to  gauge  how  great  a sep- 
aration between  surfaces  is  sufficient,  we  note 
that  surface  energies  result  from  the  spilling  of 
electron  density  into  vacuum,  where  it  cannot  be 
compensated  by  atomic  cores,  which  occurs  be- 
cause the  electronic  wave  functions  are  not  ter- 
minated abruptly  at  a surface. 33’35’5**,T  This  spill- 
ing of  the  charge  density  will  be  approximately 

p(x)-p(0)exp(-kax),  (1) 

where  kt  is  the  linearized  Thomas- Fermi  screen- 
ing wave  number.  The  important  point  is  that,  be- 
cause the  valence  electron  density  of  these  semi- 
conductors is  very  high,  kt  is  large  and  the  spilled 
charge  density  falls  off  rapidly.23’25’57  Indeed,  for 
Si 

M»*  35,  (2) 

where  Rw  is  the  Wigner-Seitz  radius.  Therefore, 
as  long  as  the  opposite  surfaces  of  an  internal  cav- 
ity remain  at  least  one  atomic  spacing  apart,  there 
is  negligible  interaction  between  them.  In  fact, 

2 Rw  is  a conservative  estimate  of  the  distance 
between  opposite  surfaces  because  atoms  move 
away  from  the  vacuum  and  toward  the  bulk  im- 
mediately upon  formation  of  the  surface,  before 
any  further  reconstruction  can  occur,  and  also 
around  single  vacancies.  This  is  simply  because 
charge  is  redistributed  from  the  dangling  bonds 
of  these  surfaces  to  the  back  bonds  causing  them 
to  shorten.13’25’ 53’5* 

It  should  be  noted  that  a minimum  extent  of  sur- 
face area  is  required  before  reconstruction  can 
occur.  Empirically,9*  the  minimal  dimension 
De  is  typically  6-8  nm.  The  rate  at  which  the  re- 
construction reaction  occurs  increases  with  increas- 
ing extent  of  the  surface. 51,40  This  variation  has  been 
shown  to  fit  the  hypothesis  that  once  the  reconstruc  - 
tion  reaction  starts  on  any  tread  of  a stepped  surface, 
it  propagates  rapidly  across  the  entire  tread  to  the  ri- 
sers.90 This  fact  has  been  explained*1’  “•“by  the  hy- 
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FIG.  2.  Schematic  representation  of  the  author’s 
proposal  of  a mechanism  for  the  conversion  of  a planar 
precipitate  of  vacancies  into  an  “interstitial-type"  dis- 
location loop  by  the  reconstruction  of  the  surfaces  of 
the  cavity,  to  (a)  we  again  see  the  cavity  formed  by 
the  precipitation  of  vacancies,  as  in  Fig.  1(a).  to  (b) 
both  surfaces  of  the  cavity  are  undergoing  the  surface 
reconstruction  reaction  which  drives  a wave  of  atoms, 
that  are  expelled  from  the  surfaces  by  the  reaction.  Into 
the  extremities  ofthecavity.  These  extra  atoms  in  the 
extremities  are  thought  to  be  in  a state  similar  to  that  of 
the  liquid  phase.  The  reconstructed  surface  is  a phase 
of  lower  energy,  lower  density,  and  different  bonding. 

In  (c)  the  atoms  that  were  expelled  by  the  reconstruction 
have  recrystallized  to  the  bulk  crystal  structure.  As 
they  do  so,  they  expand,  as  does  the  liqutd  phase,  so 
that  an  interstitlal-typedlslocation  loop  is  formed,  the 
surfaces  of  the  original  cavity  are  forced  apart  by  one 
lattice  spacing,  and  a pillar  of  good  crystalline  material 
is  left  to  stabilize  the  cavity  and  the  dislocation. 


pothesis  that  reconstructed  surface  phases  generally 
contain  fewer  atoms  than  the  ideal  (unreconstructed) 
surface  phase  t4»>«7>**-”  so  that  the  reaction  releases 
atoms  and  drives  them  in  a reaction  front  that  per- 
turbs the  metastable  ideal  phase  and  thus  propagates 
the  reaction  further.  When  this  occurs  on  an  exterior 
surface  of  a crystal,  one  would  expect  the  waves  of 
released  atoms  to  collide  and  produce  hillocks. 
This  effect  has  evidently  been  observed;  the  hill- 
ocks on  Si  being  typically  40  nm  high.”  It  pro- 
vides an  explanation  for  the  continuous  source  of 
steps,  that  is  required  for  crystal  growth,  in  the 
absence  of  screw  dislocations. “'*l 
Although  the  structure  and  coordination  of  the 
released  atoms  as  they  propagate  in  the  wave  has 
not  been  determined,  it  is  here  assumed  that,  as 


in  the  liquid  phase,  these  atoms  are  more  densely 
packed  than  in  the  tetrahedrally  coordinated  solid 
phase.  It  would  seem  that  if  the  atoms  were  tetra- 
hedrally coordinated  in  the  covalent  solid  phase , they 
would  not  be  able  to  migrate  ac  ross  the  surface  as  re- 
quired to  explain  the  kinetics  of  the  reconstruction  on 
stepped  surfaces.51*”  inanycase,  the  tetrahedral 
phase  is  the  least  dense  of  all  known  condensed  phases 
of  these  compositions.  This  point  is  important  be- 
cause it  implies  that  these  atoms  will  expand , as  does 
a liquid  drop,  when  they  recrystallize  to  the  bulk  solid 
phase. 

The  second  lemma  of  the  present  hypothesis  is 
the  contention  just  introduced  that  reconstruction 
releases  atoms  from  the  ideal  surface,  that  these 
will  migrate  in  a wave  across  the  flat  surface  un- 
til stopped  for  some  reason  whereupon  they  solid- 
ify to  the  bulk  crystal  structure  and  expand  to 
produce  a hillock,  pillar,  or  similar  surface  de- 
fect. 

Now  let  us  consider  what  should  happen  as  vacan- 
cies precipitate  in  the  interior  of  a covalent  crys- 
tal. It  is  well  known  that  there  is  a binding  energy 
of  order  1 eV  between  vacancies57’85’*4  so  that  they 
will  form  multiple  vacancy  clusters  (small  voids) 
as  the  crystal  cools  from  its  growth  temperature. 
Suppose  that  an  approximately  disk- shaped  cluster 
(cavity)  has  grown  to  a size  sufficient  to  have  a 
low  index  surface  greater  than  the  critical  extent, 
6-8  nm,  required  for  reconstruction59  without 
rewelding  (see  Fig.  2).  When  the  reconstruction 
occurs,  it  produces  a wave  of  atoms  which  pro- 
pagates through  the  void  across  the  interior  sur- 
face. The  wave  will  stop  when  it  collides  with 
another  wave,  the  end  of  the  cavity,  or  perhaps 
a patch  of  reconstructed  surface.  Thereupon, 
the  atoms  solidify  into  the  bulk  solid  phase.  Ob- 
viously, an  internal  hillock  or  pillar  produced 
inside  the  cavity  of  precipitated  vacancies  could 
not  be  as  high  as  the  40-nm  ones  on  exterior  sur- 
faces. One  might  instead  suppose  these  atoms  re- 
crystallize as  would  a drop  of  the  liquid,  expand- 
ing as  it  solidifies  and  tending  to  drive  the  oppo- 
site surfaces  of  the  cavity  apart  by  one  or  a few 
atomic  spaclcgs.  If  this  does  occur,  then  an  “in- 
terstitial-type” dislocation  loop  is  produced  in 
which  the  “extra  plane”  of  lattice  sites  contains 
regions  where  host  atoms  are  present  and  regions 
where  they  are  not  (see  Fig.  3).  Where  the  atoms 
are  not  present,  the  opposite  surfaces  are  now  re- 
constructed, so  that  there  are  no  dangling  bonds 
and  there  is  a much  larger  activation  barrier 
against  rewelding. 

Several  questions  regarding  this  hypothesis  come 
to  mind:  (1)  Is  the  proportion  of  pillar  regions  to 
void  regions  likely  to  be  sufficient  to  stabilize  the 
dislocation  loop?  (ii)  Is  the  Burgers  vector  of  the 


the  energy  released  by  reconstructing  the  inter* 
nal  surface  greater  than  that  consumed  producing 
the  dislocation  ? (vi)  Under  what  circumstances 
would  vacancy  type  dislocations  be  expected  in- 
stead? Let  us  consider  these  questions  in  order. 

(i)  The  generally  accepted,  Lander-Morrison 
model  for  the  stable  (7  x 7)  reconstruction  of  the 
(111)  surfaces  of  Si  contains  28%  fewer  atoms 
than  the  ideal  surface.4*'"  As  there  are  two  such 
surfaces  to  the  vacancy  produced  cavity,  52  atoms 
would  be  released  for  every  100  vacancies  pre- 
cipitating into  a disk- shaped  cavity  one  atomic 
spacing  thick.  Let  us  suppose  for  the  moment 
that  the  Burgers  vector  of  the  dislocation  is  the 
same  as  the  orientation  of  the  reconstructed  sur- 
face. If  the  host  lattice  is  forced  apart  one  atomic 
layer  to  form  a dislocation  loop  as  these  atoms 
solidify,  then  26%  of  the  extra  plane  of  lattice 
sites  would  be  occupied,  i.e.,  26%  of  the  region 
would  be  pillar  and  74%  void.  If  the  initial  cav- 
ity is  a disk  two  atomic  spacings  thick,  26  atoms 
would  be  released  for  every  100  vacancies  pre- 
cipitating and,  if  the  host  lattice  is  forced  apart 
one  spacing  (so  there  are  3 layers  where  there 
were  2)  to  form  the  dislocation,  then  15.3%  of  the 
loop  would  be  pillar.  If  the  initial  cavity  is  a disk 
three  atomic  spacings  thick,  still  13.8%  of  the  re- 
sultant dislocation  loop  would  be  pillar.  This 
case  is  typical  of  the  stable  reconstructions  that 
occur  on  the  low  index  surface  of  most  semicon- 
ductors.”'53 It  would  seem  that  such  proportions 
of  pillar  to  void  would  be  adequate  to  stabilize  the 
loop. 

(ii)  It  seems  clear  that  the  Burgers  vector  of  the 
dislocation  need  not  be  the  same  as  the  orientation 
of  the  cavity  surface  which  reconstructs.  If  a 
liquid  inclusion  were  to  solidify  and  to  punch  out 
dislocations,  the  resultant  Burgers  vectors  would 
be  determined  by  properties  of  the  host  as  well  as 
by  the  shape  of  the  inclusion.  Moreover,  after 
the  dislocation  is  nucleated,  its  Burgers  vector  may 
be  rotated  by  low-energy  processes,”  such  as  re- 
action with  Shockley  partial  dislocations. 

(iii)  In  addition  to  the  dislocations  that  may  be 
nucleated  by  the  action  of  impurities, 30’3*  some 
interstitial-type  dislocations  would  be  nucleated 
in  pure  semiconductor  crystals  grown  from  a 
melt,  which  is  more  dense,”  by  the  trapping  of 
liquid  inclusions  by  reentrant  (overhanging)  growth 
steps  that  are  many  atoms  high.41*43  Moreover, 

it  is  here  proposed  that  the  precipitation  of  vacan- 
cies-interior  reconstruction  mechanism  will  it- 
self nucleate  interstitial- type  dislocations  due  to 
the  expansion  of  the  released  atoms  as  they  re- 
crystallize, and  that  it  will  do  so  without  re- 
quiring that  the  vacancy  cluster  first  assume 
the  unlikely  shape  of  a disk  one  atomic  layer 


FIG.  3.  Proposed  process,  as  in  Fig.  2,  viewed  nor- 
mal to  the  plane  of  the  cavity  and  of  the  loop,  lb  (a)  we 
see  the  original  disk-shaped  cavity  produced  by  the 
vacancies  prior  to  Its  recoostructloo.  in  to)  the  cavity 
has  reconstructed  and  formed  a pillar  between  the  dis- 
location loop  and  the  reconstructed  cavity  from  the  atoms 
expelled  by  the  reconstruction  reaction.  In  (c)  more 
vacancies  have  precipitated  out  of  the  bulk  to  form  a new 
unreconstructed  void  at  the  perimeter  of  the  first  dislo- 
cation loop.  The  vacancies  are  attracted  to  this  region 
by  the  tensile  strain  field  there.  They  do  not  annihilate 
the  pillar  because  of  the  compressive  strain  field  in  that 
region.  In  (d)  the  new  cavity  has  undergone  reconstruc- 
tion producing  more  pillar  region  and  increasing  the 
dislocation  loop. 


dislocation  necessarily  the  same  as  the  orienta- 
tion of  the  cavity  surface  which  reconstructed? 
(iii)  How  about  the  problem  of  nucleating  such  a 
dislocation  in  the  first  place?  (iv)  If  such  a dis- 
location were  nucleated,  would  it  grow?  (v)  Is 
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thick.**’"  Suppose  that  the  vacancies  in  Si  precip- 
itate  to  an  octahedral- shaped  cavity,  which  is 
the  equilibrium  shape  of  a cavity  in  Si.  The  di- 
mensions of  the  (111)  surfaces  of  this  cavity  ex- 
ceed the  minimal  dimension  for  reconstruction" 
(about  6 nm)  when  about  5.1  x 103  vacancies  pre- 
cipitate into  it.  There  will  then  be  1.1  x 103  atoms 
on  the  8 surfaces  bounding  this  cavity.  If  all  sur- 
faces reconstruct,  2.9  x 10*  atoms  will  be  released 
in  the  wave  of  the  reaction  front.  Suppose  that, 
before  recrystallization,  these  atoms  form  a single 
drop  in  one  corner  of  the  octahedron  so  as  to  min- 
imize their  surface  energy.  It  seems  clear  that 
recrystallization  will  begin  where  the  drop  contacts 
the  surfaces  of  the  cavity  and  will  next  occur  on  the 
free  surface  of  the  drop.  As  with  the  freezing  of 
ice  cubes , the  interior  of  the  drop  will  be  the  last 
to  recrystallize.  If  the  drop  does  expand  about 
15%  in  volume  when  it  recrystallizes,  as  does  the 
liquid  phase,"  it  will  stress  the  surrounding  lat- 
tice. One  of  the  ways  in  which  this  stress  can  be 
relieved  is  by  the  formation  of  an  interstitial  dis- 
location loop  sufficient  to  contain  about  30  atoms 
in  its  extra  partial  plane.  Obviously  this  config- 
uration would  not  be  a state  of  minimum  energy , 
but  it  is  well  known  that  dislocations  are  not  equil- 
ibrium defects;  they  would  never  exist  were  the 
crystal  at  equilibrium.” 

(iv)  We  now  suppose  that  an  interstitial- type  dis- 
location loop  has  been  nucleated  by  one  of  the  above 
mechanisms  and  that  vacancies  diffuse  to  it.  It 
is  commonly  stated  that  vacancies  will  annihilate 
such  a dislocation.  However,  it  is  well  known 
that  impurity  atoms  form  precipitates  on  and  about 
dislocations  as  the  crystal  cools  and  their  con- 
centrations become  supersaturated.  Vacancies 
will  generally  not  annihilate  such  precipitates  be- 
cause to  do  so  would  require  redissolving  the  im- 
purity atom  into  the  solid  from  which  it  had  pre- 
cipitated. Vacancies  should  not  be  expected  to  an- 
nihilate the  dislocation  produced  by  or  to  accom- 
modate such  precipitates  either.  In  those  central 
portions  of  the  loop  where  the  extra  partial  plane 
is  composed  of  host  atoms,  the  arrival  of  vacan- 
cies from  the  bulk  will  not  remove  any  lattice 
sites  from  the  dislocation,  l.e.,  annihilate  it,  un- 
less the  vacancies  nucleate  a disk-shaped  cavity 
and  reweld  the  opposing  surfaces,  which,  as  noted 
above,  is  a very  unlikely  event.*4’43  Evidently, 
there  would  be  little  attraction  of  vacancies  to 
such  regions  of  the  loop  and  they  would  diffuse 
on  through  the  bulk.  In  those  central  portions 
of  an  “interstitial- type”  dislocation  loop  formed 
by  the  vacancy  precipitation-surface  reconstruc- 
tion mechanism  where  the  "extra  partial  plane” 
is  in  fact  a cavity  with  reconstructed  surfaces, 
the  arrival  of  a vacancy  from  the  bulk  would  mere- 


ly convert  that  bulk  vacancy  into  a vacancy  in  the 
reconstructed  surface  phase.  The  energy  of  form- 
ation of  vacancies  in  the  reconstructed  surface 
phase  is  still  undetermined,  but  it  is  certain  to 
be  greater  than  that  in  the  unreconstructed  sur- 
face phase,  where  it  is  of  order  75%  that  in  the 
bulk.41’30  Therefore,  there  may  be  little  attrac- 
tion for  vacancies  to  such  regions.  If  so,  they 
may  not  stay  there.  If  they  do  persist  on  the  re- 
constructed surfaces,  they  will  eventually  remove 
the  layer  of  surface  phase  exposing  the  bulk  phase 
bu  aath.  When  a sufficient  extent  of  this  ideal 
surface  phase  is  exposed,  it  will  reconstruct  re- 
leasing more  atoms  and  causing  the  dislocation 
and/or  the  pillar  regions  within  it  to  grow.  In 
either  case,  it  seems  that  the  arrival  of  vacan- 
cies will  not  cause  these  interstitial-type  disloca- 
tion loops  to  dissolve.  Thus,  it  is  only  on  the  per- 
imeter of  the  loop  at  sites  occupied  by  host  atoms 
that  there  is  any  danger  of  vacancies  annihilating 
the  dislocation  loop. 

Let  us  now  consider  whether  the  arrival  of  va- 
cancies from  the  bulk  to  the  perimeter  of  the  loop, 
the  core  of  the  dislocation,  should  cause  the  loop 
to  shrink  or  to  expand.  Without  knowing  anything 
about  the  disposition  of  atoms  in  the  core,  which 
is  not  firmly  established  for  these  dislocations, 
we  may  be  sure  that  for  the  present  case  of  “in- 
terstitial-type" loops,  the  region  immediately 
inside  the  perimeter  is  strained  compressively  and 
that  the  region  immediately  outside  the  perimeter 
is  subject  to  large  tensile  strain.  In  order  to 
reach  the  interior  of  the  loop,  vacancies  would 
have  to  migrate  through  the  regions  of  tensile 
strain.  However,  the  vacancies  will  tend  to  re- 
main and  to  cluster  in  the  region  of  highest  ten- 
sile strain  because  the  energy  of  vacancy  formation 
AH/(V)  is  a decreasing  function  of  the  interatomic 
spacing  r,  and  so  is  less  in  regions  of  tensile 
strain.  This  may  be  understood  by  using  the  ap- 
proximation of  the  single  vacancy  as  a macro- 
scopic cavity*3’13’”’**  and  estimating 

ZHf(V)=Ao,  (3) 

where  A is  the  surface  area  of  the  cavity  and  o 
is  the  energy  per  unit  area  of  the  cavity  surface. 
(Because  the  valence  electron  density  is  so  high, 
as  noted  at  Eq.  (2),  this  is  a rather  good  approxi- 
mation in  semiconductors,*3’*3’37  although  it  is  not 
very  accurate  for  many  metals.**)  As  A is  pro- 
portional to  r*  while  o is  proportional3*  to  r*3/a, 

Eq.  (3)  implies 

AUf(  V) » constr*1  n.  (4 ) 

One  may  also  wish  to  consider  the  effect  of  break- 
ing the  covalent  bonds.  Several  authors***71  have 
analyzed  the  bond  bending  and  bond  stretching 
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force  constants  between  atoms.  From  these  an- 
alyses, and  from  the  observed  signs  of  the  third- 
order  elastic  constants,  it  is  clear  that  the  inter- 
atomic forces  are  decreasing  functions  of  r.  (One 
might  also  consider  the  chemical  trends  in  the 
commonly  accepted  experimental  values23’*5' 57,72 
for  which  are  greater  for  C than  for  Si 

and  greater  for  Si  than  for  Ge,  etc.)  Of  course, 
some  of  the  vacancies  will  pass  through  the  regions 
of  tensile  strain  to  the  interior  perimeter  ' the 
loop  despite  this  repulsion  due  to  the  strain  field 
there.  But  before  the  compressive  stress  can  be 
removed  and  the  energy  of  the  crystal  reduced 
(as  assumed  by  those  who  assert  that  vacancies 
will  annihilate  interstitial-type  dislocations),  it 
would  be  necessary  to  cluster  a sufficient  number 
of  vacancies  in  the  interior  perimeter  so  that  part 
of  the  extra  partial  plane  there  could  be  removed 
by  the  mechanism  of  rewelding  of  the  surfaces  of 
a disk- shaped  cavity.  Evidently,  the  probability 
of  such  an  event  in  this  case  is  even  less  than  in 
regions  without  compressive  strain. 

Thus,  we  come  to  the  conclusion  that  as  vacan- 
cies arrive  in  the  vicinity  of  the  dislocation  loop 
they  will  cluster  predominately  in  the  regions  of 
highest  tensile  strain  outside  the  loop.  As  they 
do  so,  they  will  produce  extended  cavities.  The 
shape  of  these  cavities  will  be  influenced  by  the 
strain  field  outside  the  dislocation.  While  they 
will  probably  not  be  disks  of  thickness  one  atomic 
layer,  they  will  probably  also  not  have  the  equili- 
brium shape  of  cavities  in  the  host  lattice  either. 
Consequently,  when  these  cavities  reach  the  criti- 
cal extent,  they  will  reconstruct  releasing  more 
atoms  that  will  cause  the  dislocation  to  grow  so 
that  the  process  may  be  repeated  until  the  super- 
saturation  of  vacancies  is  depleted. 

(v)  Referring  to  Fig.  3(a),  we  may  estimate  the 
energy  of  an  initial  approximately  disk- shaped 
cavity  before  reconstruction  as 

E1a2tR*oa,  (5) 

where  Rx  is  the  radius  of  this  cavity  and  ax  is  the 
energy  per  unit  area  of  the  unreconstructed  sur- 
face. When  reconstruction  occurs,  the  surface 
energy  is  reduced  to  cr,  and  the  radius  of  the  void 
region  is  reduced  to  Rr  because  a fraction  of  the 
atoms  released  from  the  surface  fill  up  part  of 
the  cavity.  The  energy  of  the  cavity  plus  disloca- 
tion loop  defect  in  Fig.  3(b)  may  be  estimated73  as 

E2^2nR\<JT-yitnRJ>2{\a(R4/b)~  l]/2(l-  v),  (6) 

where  Rt  is  the  radius  of  the  dislocation  loop,  p 
)s  the  shear  modulus  of  the  host  crystal  (in  terms 
of  the  elastic  constants,74  n = Cu-f. f/5,  where 
# = 2C44  + Cia-Cu),  6 is  the  Burgers  vector  of  the 
dislocation,  and  v is  Poisson’s  ratio.  Clearly, 


El>Ei  for  R,  greater  than  some  critical  value, 
Ruc  or  for  j i less  than  a critical  value,  <ie.  For 
the  case  of  Si  at  room  temperature,  the  shear 
modulus  is70  n(RT)  = 6.81  x 10“  erg/ cm3,  while75 
6 =0.384  nm,  and  y = 0.215.  Also,  for  Si(lll) 
surfaces,25  <jk=  904  erg/cm2.  The  value  of  ar  is 
not  known  empirically  but  can  be  estimated25  by 
noting  that  the  reconstructed  surface  is  about  26% 
less  dense  than  the  unrecons  rue  ted  surface  and 
is  free  of  dangling  bonds.  This  estimate  is  <Jr 
= 613  erg/cm2.  If  we  further  suppose  that  the  pil- 
lar regions  discussed  in  (i)  lie  entirely  within  the 
original  cavity,  we  estimate 


(7) 

and 

Rr  = (1.0  — 0.26)l/2R>  = 0.86R> . 

(8) 

Let  us  further  assume  that  the  cavity  reconstructs 
as  soon  as  it  attains  the  minimum  extent,59  V =6- 

8 nm,  necessary  for  reconstruction, 
the  minimum  empirical  value  for  Oc, 

Then,  taking 

Rc-kDc- 3 nm, 

(9) 

one  calculates, 

= 5.11  x 10*‘°  erg, 

(10) 

while 


Ej  = 2.56  x 10*‘°  + 2.02  x lO-10  = 4.58  x 10' 10  erg. 

(ID 

Therefore,  with  Si  at  room  temperature,  any 
cavity  that  is  large  enough  to  reconstruct  can  do 
so  with  sufficient  energy  to  produce  the  disloca- 
tion loop  indicated  in  Figs.  2 and  3.  Thus, 

/e«(Si)  = 0, 

and  (12) 

Me(Si)  = 8.6x  10“  erg/cm3. 

However,  as  this  minimal  excess  energy  is  only 
about  10%  of  El , the  formation  of  the  dislocation 
may  be  an  improbable  event  in  such  cases.  If  the 
cavity  reconstructs  without  producing  a dislocation, 
the  defect  will  produce  little  lattice  strain  and 
probably  would  not  be  detected  by  TEM  unless  it 
becomes  decorated  with  impurities,  in  which 
case  it  would  appear  as  a precipitate. 

As  one  goes  to  higher  temperatures,  the 
temperature  variation  of  must  be  considered. 

The  author  has  found  empirical  values  of  d^/dT 
only  for  room  temperature  and  below.7®  However, 
it  can  be  calculated  from  theories  relating  the 
elastic  constants  to  the  electronic  structure71  and 
a knowledge  of  the  temperature  dependence  of  the 
electronic  structure.77’7*  (This  theory  fits  the 
room- temperature  temperature  dependence  to 
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about  20%. ) This  leads  to  the  estimate  for  Si  at 
its  melting  point  Tm, 

M(Si,rj=0.8/i(Si,RT)  = 6.45x10"  erg/cm3.  (13) 

Neglecting  thermal  expansion  of  6 and  any  temp- 
erature variation  of  <rm  and  <rr,  this  leads  to  the 
estimate  that  £,  exceeds  £,  by  about  20%  for  the 
minimum  sized  cavity  that  could  reconstruct  at 

Tm- 

In  the  case  of  GaAs  at  room  temperature,  the 
shear  modulus  is70  4 =4.85  xlO11  erg/cm3,  while7* 
i/  = 0.29  and  6 = 0.399  nm,  and25  <rB  = 812  erg/cm3 
and  <7,  = 495  erg/cm3.  With  ft,  = 3 nm,  the  mini- 
mum value  possible  for  reconstruction,  a calcula- 
tion as  above  leads  to 

£,(GaAs,RT,3  nm)  = 4.59  x 10*l°  erg,  (14) 
while 

£2  = 2 .07  x 10**°  + 1 .66  x 10*‘°  = 3 .73  x 10* 10  erg. 

(15) 

Again  we  conclude  that  any  cavity  which  can  re- 
construct would  do  so  with  sufficient  energy  to  pro- 
duce the  sort  of  dislocation  loop  here  hypothesized 
(Figs.  2 and  3).  However,  the  minimum  energy 
release  is  only  about  19%  so  that  again  it  is  prob- 
able that  the  dislocation  loop  will  not  always  be 
formed. 

(vi)  Finally,  we  consider  the  question  of  the 
conditions  under  which  one  ought  to  expect  the 
precipitation  of  vacancies  to  produce  “vacancy- 
type”  dislocations  loops  in  semiconductors.  This 
question  is  relevant  because  there  are  a few  ob- 
servations80’81 of  vacancy-type  dislocations  in  these 
materials  where  most  dislocations  are  observed 
to  show  interstitial-type  lattice  bowing  in  the  elec- 
tron microscope.  The  important  point  here  would 
seem  to  be  that  reconstruction  is  a thermally  ac- 
tivated process  .41’80’ 81  It  would  seem  that  reweld- 
ing should  occur  without  an  activation  barrier  if 
unreconstructed  surfaces  (with  dangling  bonds  in 
compatible  orientation)  are  mechanically  forced 
together  by  any  stress  field  that  happens  to  be 
present.  If  the  dislocation  is  nucleated  at  too  low 
a temperature,  the  reconstruction  mechanism  will 
not  be  competitive  against  rewelding.  Although 
one  would  have  to  consider  the  details  of  a par- 
ticular case  to  make  an  accurate  estimate,  a crude 
assumption,  that  the  critical  temperature  Tc, 
dividing  the  regimes  of  nucleation  of  vacancy-type 
and  of  interstitial- type  dislocation  loops  is  that 
for  which  the  mean  time  to  reconstruction  is  600 
sec,  leads4*’80  to  the  estimate  that  Te(Sl)  = 390°C 
and  that  Te(GaAs)  = 380 °C. 

Once  the  loop  has  been  nucleated  and  has  started 
to  grow  as  one  type  or  the  other,  on”  should  expect 


it  to  continue  to  grow  as  that  same  type.  In  the 
case  of  an  “interstitial- type”  loop,  this  follows 
because  of  the  tensile  strain  field  outside  the 
perimeter,  where  the  vacancies  would  tend  to 
cluster  and  permit  the  lattice  planes  on  eiiher 
side  of  the  extra  partial  plane  to  separate  further 
apart.  In  the  case  of  a "vacancy- type”  dislocation, 
this  would  follow  because  the  vacancies  would  tend 
to  cluster  on  the  outside  perimeter  of  the  loop, 
where  the  atoms  are  not  well  bonded  (Fig.  1)  so 
that  Is  small,  and  would  permit  the  host- 

lattice  planes  on  each  side  of  the  missing  partial 
plane  to  come  together  and  reweld.  Note  that  in 
both  cases  vacancies  will  be  approaching  the  core 
of  the  dislocation  predominantly  from  the  outside 
of  the  loop  because  the  dislocation  is  gettering 
them  from  the  bulk  of  the  crystal. 

The  distinction  just  made  may  serve  to  resolve 
the  discrepancy  between  the  reports  by  Klmerling 
et  al.li  and  by  O’Hara  et  al .*8  that  dark  line  de- 
fects (DLD’s)  in  GaAlAs  lasers  are  a convolution 
of  interstitial- type  dislocations,  and  the  report 
by  Woolhouse  et  al. 80  that  the  dislocations,  at  least 
in  optically  degraded  material,  are  in  fact  of  vac- 
ancy type.  The  former  two  groups  studied  DLD’s 
which  formed  about  dislocations  that  were  produced 
during  crystal  growth  and  thus  were  nucleated 
while  the  crystal  was  hot  enough  for  reconstruction 
to  occur  in  times  of  order  10  msec.  Woolhouse 
et  al.  studied  DLD’s  which  formed  about  disloca- 
tions that  were  introduced  into  the  sample  by 
scratching  its  surface  at  room  temperature  and 
causing  it  to  glide  by  optical  excitation.  The  mean 
time  to  reconstruction  over  the  activation  barrier 
at  room  temperature  is  calculated  to  be41  or  order 
10*2  years. 

IV.  PROPOSED  EXPERIMENTS  THAT  MAY  VERIFY 
THE  MECHANISM 

Although  the  mechanism  proposed  here  has  the 
attraction  that  it  would  serve  to  resolve  the  funda- 
mental question  of  the  nature  of  deep  point  and  line 
defects  by  explaining  the  general  observation  of 
extrinsic  stacking  faults  and  “interstitial- type” 
dislocations  without  invoking  any  self- interstitials, 
and  thus  contradicting  the  general  conclusion  that 
vacancies  are  far  more  numerous,  it  should  be 
subjected  to  further  scrutiny. 

One  fairly  clear  distinction  between  this  mech- 
anism and  the  alternative  that  the  dislocations  are 
formed  by  the  precipitation  of  self- interstitials 
that  were  incorporated  during  crystal  growth  is 
that  the  present  mechanism  implies  the  presence 
of  voids  in  the  crystal  taking  up  somewhat  fewer 
lattice  sites  than  the  initial  number  of  vacancies, 
i.e.,  of  order  10**-1018  per  cm3,  while  the  self- 
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interstitial  mechanism  implies  there  will  be  many 
orders  o f magnitude  less  void.  As  was  noted  in 
Sec.  n,  the  presence  of  small  (or  order  6 nm) 
voids  is  difficult  to  detect1®  by  TEM.  One  could 
not  distinguish  the  difference  in  absorbing  power 
from  surface  effects  and  noise.  The  scattering 
intensity  from  vacancies  and  interstitials  is  the 
same  in  the  absence  of  lattice  distortions.  In 
either  case  there  would  be  lattice  distortion  due 
to  impurities  because  both  types  of  dislocation 
would  tend  to  attract  impurities®3  by  their  strain 
fields.  (In  fact,  the  dislocations  are  almost  always 
observed  to  be  decorated  with  impurities.®3)  How- 
ever, the  presence  of  such  voids  should  be  detected 
if  one  diffuses  a radioactive  gas,  such  as  3H  or 
3TAr  or  41  Ar  or  3*Ar,  through  the  sample®4  and  mea- 
sures the  radioactivity  remaining  in  the  sample. 

As  these  gases  diffuse  rapidly  and  inter stitially , 
there  would  be  a negligible  concentration  of  these 
on  substitutional  lattice  sites,  but  they  would  tend 
to  be  trapped  within  any  voids  in  the  sample. 

Other  interstitially  diffusing  impurities,  such 
as  Cu  or  Li,  would  also  be  expected  to  precipi- 
tate into  any  cavities  that  are  present  in  the  inter- 
ior of  the  loop  as  well  as  around  the  perimeter. 
With  the  self- interstitial  mechanism,  one  would 
expect  to  find  such  precipitates  only  about  the  core 
-of  the  dislocation  and  not  in  the  body  of  the  loop, 
which  should  be  perfect  material  according  to  that 
model.  Therefore,  by  determining  the  number  of 
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atoms  of  Li  or  of  Cu  which  precipitate  at  low  tem- 
peratures on  dislocation  loops  that  were  previously 
grown  into  the  sample,  such  as  the  swirl  defects®3 
in  Si,  and  comparing  this  to  the  number  of  sites 
on  the  perimeter  of  the  loops  and  to  the  number  of 
sites  within  the  loops,  one  should  be  able  to  deter- 
mine if  the  impurity  is  restricted  to  the  dislocation 
core.  It  may  also  be  possible  to  detect  the  pres- 
ence of  such  impurities  in  the  body  of  the  loop  by 
high- resolution  electron  energy-loss  analysis** 
or  by  x-ray  microanalysis.*7 

It  should  be  noted  that  the  absence  of  Moire 
fringes  in  the  body  of  many  of  these  loops  is  not 
conclusive  evidence  that  the  material  in  the  loop 
is  perfect.  It  may  just  as  well  be  that  the  mater- 
ial is  highly  disordered,  essentially  amorphous 
in  that  region,  or  that  it  consists  of  pillars  of 
perfect  material  dispersed  between  regions  of 
void  or  of  disordered  material. 
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Threshold  for  Optically  Induced  Dislocation  Glide  in  GaAs-AlGaAs  Doable 
Heterostnictures:  Degradation  via  a New  Cooperative  Phenomenon? 

B.  Monemar, w R.  M.  Potemski,  M.  B.  Small,  J.  A.  Van  Vechten.  and  G.  R.  Woolhousew 
IBM  Thomas  J.  Watson  Rastarek  Cantor.  Torktown  Bright s,  Now  York  10598 
(Received  5 October  1977) 


We  have  observed  a sharp  threshold  for  the  process  at  optically  Induced  slide  at  which 
the  velocity  changes  by  more  than  a factor  of  103  when  the  excitation  Intensity  chafes 
only  20%.  This  threshold  la  Insensitive  to  doping  and  to  the  presence  of  a p-n  Junction. 
The  effect  is  shown  not  to  be  related  to  recombination- enhanced  motion  or  to  local  heat- 
ing. An  explanation  hi  terms  at  the  reduction  of  frictional  forces  by  Interaction  with  un- 
recombined carriers  la  offered. 


It  has  been  reported  previously3*3  that  optical 
excitations  of  intensity  in  excess  of  10s  W cm*3 
can  lead  to  rapid  (SO  j*m  s*‘)  dislocation  glide  in 
AlGaAs  double -heterostructure  lasers.  The  ob- 
servations were  made  using  photoluminescence 
topography.  A laser  wafer,  without  the  normal 
capping  layer  of  GoAs,  is  Illuminated  by  the  fo- 
cused 647.1-  and  616. 4- nm  radiation  of  a Kr+  la- 
ser, to  which  the  upper  waveguide  layer  is  trans- 
parent but  the  active  layer  is  not.  The  active 
layer  may  be  observed  by  the  photoluminescence 
produced  in  it  Defects  are  apparent  as  dark  re- 
gions because  of  the  higher  probability  of  nonra- 
diative  recombination  events  in  their  vicinity. 
Thus  the  motion  of  a gliding  dislocation  may  be 
observed  directly  as  a moving  dark  spot  in  the 
photoluminescence  field. 


The  glide  process  does  not  occur  at  disloca- 
tions which  were  present  during  growth,  but 
only  at  fresh  dislocations.  These  are  introduced 
by  gently  scratching  the  surface  of  the  upper 
waveguide  layer.  On  applying  an  optical  excita- 
tion of  sufficient  duration,  a dense  dislocation 
network  develops  within  a few  microns  of  the 
scratch.  Similar  networks  have  been  observed 
in  studies  of  moderate-intensity  10.6-pm-laser 
damage  in  GaAs.3 

A secondary  stage  of  damage  may  now  be  pro- 
duced if  the  optical  excitation  is  sufficiently  in- 
tense. Then  we  find  that  dislocation  loops  will 
glide  away  from  the  damaged  area.  This  results 
in  a portion  of  the  dislocation  threading  from  the 
bottom  waveguide  layer  through  the  active  layer 
and  the  upper  waveguide  layer  to  the  surface. 
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This  new  threading  dislocation  may  be  made  to 
glide  in  a (110)  direction.  As  it  does  so,  it  lays 
down  an  increasing  length  of  misfit  dislocation. 

If  one  assumes  the  scratch  punches  out  intersti- 
tial dislocations,  then  the  extra  half-planes  may 
relieve  the  misfit  between  the  active  layer,  which 
has  a smaller  lattice  constant,  and  the  underly- 
ing waveguide.  Therefore,  misfit  dislocations 
are  expected  to  occupy  this  interface.  Trans- 
mission electron  microscopy  has  been  used  to 
confirm  this  location.4 

Dislocation  glide  in  double -heterostructure 
lasers  has  also  been  reported  under  conditions 
of  electrical  excitatior  and  simultaneous  stress.5** 
The  same  phenomenon  has  been  observed  in  GaP 
light-emitting  diodes.7  In  order  to  explain  this 
remarkably  enhanced  dislocation  velocity  both 
local  heating  effects  and  recombination-enhanced 
motion  have  been  suggested  tentatively.1*2*6  In 
this  Letter  we  report  additional  observations  of 
these  effects.  We  find  a sharp  threshold  of  ex- 
citation intensity,  above  which  the  dislocation 
moves  extremely  rapidly.  The  nature  of  the 
change  about  the  threshold  is  so  great  as  to  im- 
ply a critical  phenomenon  and  to  discount  the 
previously  suggested  mechanisms.  We  provide 
both  a description  of  the  observations  made  on 
dislocation  movement  and  a new  tentative  theory 
which  may  explain  the  sharp  threshold  we  have 
observed.  We  conclude  it  is  likely  that  some 
cooperative  phenomenon  is  active  in  producing 
the  observed  critical  phenomenon. 

The  materials  used  for  the  present  study  con- 
sisted of  three  layers  grown  on  GaAs(lOO)  sub- 
strates by  liquid-phase  epitaxy.  These  layers 
were  the  lower  waveguide,  the  active  layer,  and 
the  upper  waveguide.  The  normal  GaAs  capping 
layer  was  omitted.  Details  of  both  the  A1  con- 
tents of  the  three  layers  and  the  doping  of  the 
layers  are  given  in  Table  I. 

The  exciting  radiation  was  focused  to  a spot 
whose  diameter  was  30  pm.  The  intensities  used 
in  this  investigation  are  within  the  range  of  those 


at  which  semiconductor  lasers  are  operated. 

(The  absolute  limit  of  optical  flux  which  may  be 
obtained  from  such  a laser  is  that  which  causes 
catastrophic  facet  damage.  This  occurs  when  the 
emergent  flux  is  greater  than**9  3 x 10*  W/cm2. 

The  internal  flux  is  greater  than  this. ) It  is  also 
worth  noting  that  the  samples  were  not  lasing. 
Although  the  local  flux  was  above  threshold,  only 
a small  fraction  the  total  cavity  was  being  pumped 

As  dislocations  were  seen  to  move  in  the  ir- 
radiated region,  the  sample  was  tracked  to  keep 
the  moving  point  central  in  the  field. 

The  nature  of  the  movement  was  complicated. 

At  very  high  excitation  levels,  -3  x 10s  W cm*2, 
the  dislocation  glides  with  a fairly  uniform  rate 
across  the  excited  area.  As  the  intensity  is  re- 
duced a threshold  region  is  reached  where  the 
dislocation  velocity  diminishes  by  at  least  three 
orders  of  magnitude  for  a 20%  change  in  the  ex- 
citing intensity.  The  motion  of  the  dislocations 
also  becomes  irregular  in  this  region.  When  the 
intensity  is  below  threshold,  the  speed  is  60 
nm/s  or  less  and  is  in  a (100)  rather  than  a 
(110)  direction.  This  behavior  of  velocity  as  a 
function  of  intensity,  as  intensity  is  reduced,  is 
shown  in  Fig.  1. 

The  growth  in  the  ( 100)  direction  at  low  inten- 
sity has  been  investigated  by  transmission  elec- 
tron microscopy.10  A complex  structure  is  pro- 
duced as  a result  of  dislocation  climb.  Such  struc- 
tures are  evidenced  as  dark  line  defects.  Once 
the  dislocation  has  grown  a few  microns  by  the 
climb  mechanism,  it  will  not  glide  again. 

It  has  been  proposed  that  the  glide  mechanism 
is  thermally  activated.*  If  this  were  the  case, 
the  dislocation  velocity,  v,  would  be  given  by 

vbBTeaM,  (1) 

where  v is  an  attempt  frequency,  b the  Burgers 
vector  of  a dislocation,  V the  activation  energy, 
r the  local  stress,  m an  exponent  in  the  range11 
from  1 to  2,  and  £ Is  a constant.  It  has  been 
estimated12* 19  that  the  temperature  rise  under 


TABLE  L Sample  parameters  (layers  numbered  from  substrate)  and  measured  threshold  intensities,  / t . 


Wafer 

It 

(10*  W/cm2) 

Percentage  A1 
(layers  1/2/3) 

Carrier  type,  number/cm1  (dopant) 

Width 

C pm) 

A 

0.8 

32/1/32 

In,  10‘5(Sn)l/ln,  10l*(Te)l/ln , 10“(Sn)l 

3.1/0.7/0.9 

B 

1.4 

41/1/41 

[n,  10n(Te)l/l*,  101T(Sn)l/tfi,  10u(Sn)l 

5.0/0. 7/1.5 

C 

1.6 

40/1/40 

(n,  10nCre)l/ln,  10“<—  )J/lp,  10,T(Oe)l 

4.5/0.5/0.8 

D 

2.0 

40/1/40 

(n,  10IT(Te)]/lp,  10M«3e)]/(p,  10,T(Oe)l 

3.6/0.6/0.6 
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FIO.  1.  Semilog  plot  of  velocity  of  photolumlneaence 
spot  due  to  threading  dislocation  versus  external  in- 
tensity of  647.1-  and  676.4-nm  laser  light.  External 
intensity  is  controlled  to  about  10%;  Internal  intensity 
may  vary  more  due  to  variations  in  sample  surface. 

At  the  highest  intensities,  velocity  uncertainty  be- 
comes as  large  as  indicated.  Just  below  threshold  the 
scatter  in  data  is  large,  but  it  la  difficult  to  separate 
any  glide  component  of  the  predominant  climb  motion. 
Only  the  highest  velocity  values  observed  for  several 
dislocations  are  shown  below  threshold  whereas  several 
measurements  are  shown  for  each  Intensity  in  the 
threshold  region. 


these  experimental  conditions  may  be  as  great 
as  200°C.  Three  scratched  samples  were  heated 
to  temperatures  of  2S0,  400,  and  500°C,  respec- 
tively, for  periods  of  1 min.  No  dislocation 
glide  was  observed  to  have  taken  place.  Upon 
optical  excitation,  the  samples  behaved  exactly 
like  unannealed  samples.  In  Eq.  (1)  r would  not 
be  expected  to  be  more  than  linear  in  the  inten- 
sity i,  so  that  one  would  not  expect  the  preex- 
ponential to  be  more  than  quadratic  in  J.  Note 
that  the  effect  of  the  nonlocal  heating  due  to  the 
absorption  of  the  light  only  in  the  GaAs  is  to 
reduce  the  misfit  stress  that  drives  the  glide. 
This  is  because  the  GaAs  has  the  smaller  lattice 
constant  and  expands  as  it  is  heated.  A quantita- 
tive analysis  of  the  nonlocal  heating  shows  that 
it  reduces  the  misfit  stress  less  than  10%  in  our 


experiment. 

A further  characteristic  of  thermally  activated 
glide  is  that  it  is  strongly  dependent  upon  the 
doping  of  the  crystal.14  Therefore,  the  different 
compositions  of  the  structures  listed  in  Table  I 
ought  to  be  expected  to  show  large  differences  in 
threshold  levels  or  velocities.  Neither  is  ob- 
served. 

The  other  previously  proposed  mechanism  for 
glide  is  that  energy  from  recombining  electron- 
hole  pairs1’  enhances  the  motion  of  the  disloca- 
tion. *•*  In  this  case,  one  may  write 

vi Me'^rAJe^-^),  (2) 

where  A is  a constant  and  R is  the  energy  donated 
by  a recombination  event.  According  to  this  hy- 
pothesis v would  simply  be  linear  in  L 

We  will  now  present  a tentative  explanation  for 
optically  induced  glide  which  we  find  consistent 
with  our  observations.  It  is  proposed  that  the 
threading  dislocation  glides  locally  when  the  in- 
stantaneous concentration,  N,  of  unrecombined 
electron-hole  pairs  in  its  vicinity  fluctuates  from 
its  mean  value,  M,  to  such  a high  value,  Nt,  that 
the  frictional  forces  of  the  lattice  are  weakened 
below  the  level  required  to  prevent  the  disloca- 
tion from  gliding  under  its  resolved  stress.  We 
recall14*21  that  the  excitation  of  an  electron  from 
the  bonding  states  near  the  top  of  the  valence  band 
to  the  antibonding  states  of  the  conduction  band 
severely  reduces  the  frequency  of  transverse 
acoustic  lattice  modes  and  the  shear  strength. 
When  Nc» M,  the  probability  that  an  instanta- 
neous fluctuation  in  N will  drive  it  above  Nt  is  a 
very  rapidly  increasing  function  of  M,  and  thus 
of  4 for  virtually  all  plausible  models  of  the 
statistical  distribution. 

We  illustrate  our  proposal  with  an  admittedly 
oversimplified  model  in  which  the  number,  n,  of 
electron-hole  pairs  in  a critical  volume,  V,  about 
the  dislocation  obeys  the  Poisson  distribution, 

PW-  «"#—/«!,  (3) 

where  m is  the  mean  number  of  pairs,  and  as- 
sume 

v*vti£lp(n)e~u<',‘T,T)/tT,  (4) 

where  the  activation  energy,  U,  is  now  taken  to 
be  a function  of  n,  of  temperature,  and  of  strain. 
Note  that  even  at  the  highest  velocities  the  dis- 
location jumps  only  very  infrequently  (its  velocity 
is  much  less  than  the  speed  of  sound).  Taking  v 
-5xl0u/sec  and  6* 0.4  nm,  one  would  have  vb 
* 2.0  x10s  m/sec,  a factor  of  10T  faster  than  the 
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highest  velocity  observed  above  threshold  and  at 
least  IQ10  faster  than  below.  With  a 1-ns  lifetime, 
At  is  of  order  101*  cm"3  in  the  bulk  of  the  active 
layer  at  threshold.  Because  of  recombination  at 
the  dislocation,  let  us  suppose  it  is  5 x 10U  in  the 
region  V about  the  dislocation.  We  may  also  sup- 
pose V has  a radius  of  1.0  nm  and  a length  of  1.0 
/im  so  that  its  volume  is  3 xl0"“  cm3  and  there- 
fore m - 15.  If  we  further  suppose  that  the  criti- 
cal number  » = 45,  the  intrinsic  value  at  the  maxi- 
mal melting  point,  we  find  P(45)  = 2. 1 x 10"  w. 

Now  if  the  intensity  and  thus  m is  increased  by 
20%  to  18,  we  find  that  P(45)  * 7.9  xlO"7,  an  in- 
crease of  more  than  three  orders  of  magnitude. 
While  this  choice  of  suppositions  serves  to  pro- 
duce from  Eq.  (4)  a predicted  onset  of  the  glide 
consistent  with  the  lower  bound  for  the  sharpness 
of  the  observed  threshold,  it  is  obvious  that  a 
number  of  other  choices  of  approximations  and 
parameters  would  also  serve  the  purpose.  We 
suspect  that  the  threshold  is  actually  much  sharp- 
er than  this  lower  bound.  (Observation  of  lower 
rates  of  glide  is  obscured  by  the  dislocation 
climb.)  The  threshold  would  be  sharper  if  a co- 
operative phenomenon  among  the  unrecombined 
carriers  operates  to  cause  their  concentration 
to  increase  more  than  linearly  with  I near  the 
critical  intensity.  Possible  cooperative  pheno- 
mena might  include  a saturation  of  the  recom- 
bination channels  about  the  dislocation  or  a po- 
larization of  the  dislocation  which  might  attract 
carriers  from  the  bulk  or  strain  or  field  grad- 
ients produced  by  the  carriers  that  might  sepa- 
rate electrons  and  holes  spatially  to  prevent 
their  recombination. 

The  difficulty  in  causing  the  dislocation  to  glide 
once  it  has  climbed  may  be  explained  either  by 
the  reduction  in  carrier  lifetime,  and  thus  in  M 
and  m,  in  the  vicinity  of  a climb  network  due  to 
recombination  at  it,  or  by  a pinning  effect  of  the 
climb  network  upon  the  threading  dislocation. 

The  inability  to  glide  grown-in  dislocations  may 
similarly  be  explained  by  the  recombination  due 
to  defects  which  congregate  about  dislocations  at 
growth  temperatures,  or  by  a difference  in  the 
geometries  of  the  two  dislocations.  We  note  that 
the  maximum  n-  or  p-type  doping  in  our  samples 
was  about  1 xl0“  cm'3,  which  is  a factor  of  15 
less  than  our  assumed  value  for  Nt,  so  that  the 
observed  insensitivity  of  threshold  to  doping 
seems  also  to  be  consistent.  Moreover,  the  in- 
trinsic carrier  concentration  reaches  Ne  only  at 
the  congruent  melting  point  of  the  material;  so 
the  lack  of  thermally  activated  migration  at 


500°C  is  also  consistent. 

There  remain  three  questions:  (a)  What  limits 
the  velocity  of  the  dislocation  above  the  thresh- 
old; (b)  why  does  the  portion  threading  through 
the  top  layer,  where  it  does  not  relieve  any  mis- 
fit stress,  glide  along  with  the  portion  at  the  low- 
er GaAs-alloy  interface;  and  (c)  is  there  any  re- 
lation between  this  phenomenon  and  the  increased 
plasticity  found  when  metals  enter  the  supercon- 
ducting state22? 
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C-aAs  is  different  from  Si* 
by 

J.  A.  Van  Vechten 

IBM  Thomas  J.  Watson  Research  Center 
Yorktown  Heights,  New  York  100598 

Defect  thermochemistry  in  zinc-blende  III-V’s  is  different  from,  and  more  complicated 
than,  that  in  Si  or  Ge  because  the  two  sublattices  of  the  III-V  are  inequivalent.  Thus  a given 
impurity  may  occur  on  either  the  anion  or  cation  sublattice  and  may  transfer  from  one  to  the 
other  with  atomic  diffusion.  Moreover,  simple  anion  vacancies,  for  example,  convert  to  cation 
vacancies  plus  cation-on-anion-site  antisite  defects  each  time  the  vacancy  migrates  to  a nearest 
neighbor  site.  In  this  paper  two  effects  resulting  from  this  complication  will  be  treated.  These 
are  the  variation  in  the  rate  of  anion  and  cation  atomic  diffusion  with  the  stoichiometry  of  the 
host  material  and  the  site  preference  and  compensation  ratio  of  Group  IV  impurities  (C,  Si, 
Ge,  and  Sn)  in  III-V’s.  The  first  problem  has  been  highlighted  by  the  recent  observations  of 
Small  and  Ghez  that  Al  diffusion  in  GaAIAs  is  about  10s  times  faster  under  LPE  conditions 
than  the  rate  measured  for  bulk  MBE  material  by  Chang,  el  al.  and  by  Dingle,  ct  al.  This 
discrepancy  is  in  accord  with  the  author’s  previous  estimates  of  the  equilibrium  concentrations 
of  anion  and  cation  vacancies  for  the  appropriate  growth  conditions  (LPE  is  done  on  the  metal 
rich  side  and  MBE  on  the  As  rich  side  of  the  existence  curve)  and  the  conclusion  that 
self-dif fusion  occurs  by  nearest  neighbor  vacancy  migration  in  GaAs  and  most  other  Ill-V's. 
Thus  the  rate  of  Al  diffusion  is  proportional  to  the  As  vacancy  concentration.  The  donor  or 
acceptor  character  of  Group  IV  impurities  is  governed  by  two  factors.  The  relative  concentra- 
tions of  anion  and  cation  vacancies  during  growth  implies  more  p-type  behavior  in  LPE 
material  than  in  MBE  or  VPE  material.  The  second  factor  is  the  site  preference  of  a given 
impurity  independent  of  stoichiometry.  Two  factors  contribute  to  this  site  preference  -the 
relative  size  of  the  impurity  and  host  atoms  and  the  electronic  structure  of  the  impurity’s  core. 
It  is  argued  that  large  size  mismatches  of  either  sign  are  more  easily  accommodated  on  an 
anion  site  than  on  a cation  site,  so  that  both  C and  Sn  are  acceptors  in  GaAs.  However.  Si. 
which  is  nearly  the  same  size  as  Ga  or  As,  prefers  the  cation  site  due  to  its  lack  of  a filled 
d-core  and  is  usually  a donor.  Ge  has  little  preference  because  its  size  and  core  are  nearly  the 
same  as  those  of  Ga  and  As.  It  is  a donor  or  acceptor  according  to  stoichiometry. 

* This  work  was  supported  in  part  by  the  U.  S.  Air  Force  Office  of  Scientific  Research  under 
contract  No.  F 49620-77-C-0005. 
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Self-Consistent  Method  for  Point  Defects  in  Semiconductors:  Application 

to  the  Vacancy  in  Silicon 

J.  Bernbolc,  Nunxio  O.  Liparl,  and  Sokrates  T.  Pantelidaa 

IBM  Thomas  J.  Watson  Rsssarck  Cantor,  York  toon  Brights,  Now  York  10598 
(Receive d 30  June  1978) 

We  report  the  develop  meat  at  i method  to  oelonlete  eetf-oooelateatly  the  electronic 
etraotnre  at  neutral  point  defects  in  semiconductors.  The  method  is  an  adaptation  of  the 
original  Koeter-Slater  idea.  Cslowlations  become  feasible,  practical,  and  accurate  at  the 
level  of  current  band- structure  and  surface  calculations  when  an  LCAO  basis  sat  is  used 
instead  of  Wannier  functions.  A detailed  aMf  °f  the  isolated  vacancy  is  Si  la  used  to 
illustrate  the  method. 


Point  (Infects  (vacancies,  interstitials,  etc.) 
and  impurities  in  semiconductors  are  known  to  in- 
troduce localised  states  with  energy  levels  in  the 
fundamental  gap.  While  shallow  levels  and  some 
moderately  deep  levels  are  adequately  described 
by  effective-mass  theory,1  the  theoretical  de- 


scription of  most  deep  levels,  which  play  a domi- 
nant role  in  determining  many  properties  of  elec- 
tronic devices,  has  been  one  of  the  major  out- 
standing problems  of  semiconductor  physics.  A 
large  number  of  methods  have  been  introduced 
and  used  for  a variety  of  defects  and  impurities.2 
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The  results  have  provided  substantial  quali' 
understanding  of  the  nature  of  binding  at  deep 
levels,  but  no  approach  has  established  itself  as 
capable  of  providing  accurate  and  unambiguous 
solutions , comparable  in  accuracy  and  reliability 
to  the  calculations  that  are  currently  possible 
for  perfect  bulk  solids  (band  theory) , surfaces, 
and  interfaces.  In  this  Letter,  we  report  the  de- 
velopment of  a method  which  is  capable  of  pro- 
ducing results  of  precisely  such  accuracy  and  re- 
liability and  illustrate  its  power  with  a detailed 
description  of  an  isolated  neutral  unreconstructed 
vacancy  in  silicon. 

The  problem  at  hand  has  two  distinct  aspects: 
the  choice  of  Hamiltonian  and  that  of  a method  to 
seek  the  corresponding  eigensolutions.  Even 
though  a variety  of  Hamiltonian  choices  (semi- 
empirical,  superposition  of  atomic  potentials, 
etc.)  can  provide  useful  information,  we  have 
elected  to  use  our  method  in  the  context  of  a self- 
consistent  local-density  theory  of  electronic 
structure.  This  choice  frees  our  results  from 
any  dependence  on  the  assumed  similarity  of  in- 
teractions in  the  perturbed  system  to  those  in 
the  unperturbed  bulk  crystal.  We  show  below 
that  some  of  the  results  we  obtained  previously 
using  a semiempirical  tight-binding  Hamiltonian 
in  fact  survive  the  iteration  to  self-consistency, 
but  others  do  not. 

Given  the  Hamiltonian,  we  now  turn  to  the  choice 
of  method.  The  most  common  method,  namely 
the  cluster  method,  has  recently3  been  used  with 
self-consistent  Hamiltonians.  The  conclusion 
from  these  studies  was  that  even  54-atom  clus- 
ters, the  largest  that  could  be  handled,  are  not 
adequate  to  contain  the  bound- state  wave  functions 
and  produce  a large  uncertainty.3  More  recently, 
two  of  utf4  demonstrated  that,  when  a semiempir- 
ical Hamiltonian  is  used,  the  most  powerful  and 
accurate  method  available  to  solve  the  problem 
of  neutral  point  defects  is  an  adaptation  of  the 
Kcster-Slater5  method,  first  introduced  in  1954 
and  subsequently  used  in  several  studies  of  point 
defects?’7  and  surfaces.*  Our  method  has  three 
clear-cut  advantages  over  the  cluster  approach. 
First,  our  approach  permits  us  to  focus  directly 
on  changes  in  the  electronic  structure  caused  by 
the  defect;  related  to  this  point  is  the  fact  that 
properties  of  the  bulk,  such  as  the  band  gap,  are 
accurately  built  in  to  our  calculations  and  are 
not  affected  by  introduction  of  the  defect.  This 
is  not  true  of  even  the  largest  clusters  (54  atoms3) 
which  have  been  studied  self-consistently.  Sec- 
ond, our  formalism  fully  exploits  the  translation- 


al symmetry  of  the  host  material.  The  third  im- 
portant advantage  of  this  approach  is  that  the 
magnitude  of  the  required  numerical  problem  s 
governed  by  the  spatial  range  of  the  perturbation 
potential  rather  than  the  very  much  greater  range 
over  which  the  electronic  wave  function  is  altered. 
In  the  particular  application  discussed  below,  for 
example,  a full  30o  of  the  charge  associated  with 
the  localized  state  in  the  band  gap  lies  outside  of 
the  region  in  which  the  perturbation  potential  is 
localized. 

A description  of  the  Koster-Slater  method  is 
available  in  the  original  paper  and  in  a series  of 
papers  by  Callaway  and  Hughes6  and  Callaway 
The  most  convenient  formulation  of  the  method 
is  in  terms  of  Green’s  functions.  If  G°(£)  is  the 
Green’s-function  operator  for  the  perfect  crystal, 
bound  states  in  the  gap  introduced  by  a given  per- 
turbation U are  given  by  the  zeros  of  the  determi- 
nant 

Z>(£)  =detll  1 -G°(E)f/  !|.  (1) 

The  change  in  the  density  of  states  in  the  band 
continua  is  also  given  by  an  expression  involving 
only  £(£■).  More  importantly,  the  change  in  the 
charge  density,  which  is  needed  for  self-consist- 
ency, is  given  by 

Ap  = | /occdMl  - U - G°(E)C,ri}G°(E).  (2) 

For  analytical  as  well  as  numerical  work,  the 
operators  appearing  in  the  above  formulas  must 
be  represented  in  a convenient  basis  set.  Tradi- 
tionally, operators  have  been  expressed  in  terms 
of  Wannier  functions  as  the  natural  basis  set  of 
localized  functions.  The  first  applications  of  the 
Koster-Slater  approach,*  however,  proved  ex- 
tremely cumbersome,  not  for  reasons  related  to 
the  method  per  se , but  for  reasons  related  to  the 
construction  of  the  Wannier  functions.  The  final 
results  had  to  be  empirically  adjusted  even  to 
obtain  a bound  state  in  the  gap.  We  avoid  these 
serious  difficulties  by  starting  with  the  linear- 
combir.ati  on  - of  - ato  mi  c - orbi  tal  s (LCAO)  method 
for  a self-consistent  pseudopotential  band-struc- 
ture calculation.10  We  then  use  the  same  set  of 
LCAO  orbitals  as  basis  states  for  all  the  oper- 
ators of  the  Koster-Slater-Green’s-function 
formalism.11  While  simple  LCAO  orbitals  are 
not  conveniently  orthonormal,  they  are  enor- 
mously easier  to  generate  and  the  relatively  small 
number  of  functions  actually  required  to  describe 
the  defect- induced  change  in  the  electron  density 
can  be  easily  orthonormalized.  Physically,  the 
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implicit  assumption  is  made  that  as  long  as  the 
chosen  set  of  LCAO  orbitals  on  each  atom  of  the 
perfect  crystal  is  adequate  to  give  an  accurate 
Land  structure,  then  the  same  set  of  orbitals  at 
all  rites  is  also  adequate  to  describe  the  infinite 
crystal  containing  a single  vacancy.11  Having 
chosen  a set  of  LCAO  orbitals  on  each  atom,10 
we  exploit  symmetry  and  form  symmetrized  orbi- 
tals on  each  shell  of  atoms  surrounding  the  defect 
site  (shell  orbitals).  Finally,  because  shell  orbit- 
als are  not  orthogonal,  we  construct  orthogonal- 
ized  shell  orbitals  (OSO’s)  by  orthogonalizing 
each  shell  orbital  to  the  orbitals  on  all  the  shells 
closer  to  die  defect  site. 

The  Green ’s-function  matrix  elements  between 
pairs  of  OSO’s  are  then  calculated  using  standard 
techniques  of  Brillouin-zone  integration  for  the 
imaginary  part  and  a Hilbert  transform  for  the 
real  part.  Since  the  number  of  LCAO  orbitals  on 
each  atom  is  finite,  the  Hilbert  transform  is 
uniquely  defined  in  this  case.11  These  quantities 
are  calculated  once  for  a given  host  crystal  and 
stored.  For  a given  U,  one  has  simply  to  calcu- 
late its  matrix  elements  between  pairs  of  OSO’s 
and  construct  the  quantity  D{E)  of  Eq.  (1),  which 
now  becomes 

D(E)=detUat-TJrGar0U-)UrB\\,  (3) 

where  a , 0 , and  y label  the  OSO’s.  The  self-con- 
sistent-field  iteration  is  initiated  with  an  arbi- 
trary estimate  of  U.  The  change  in  the  charge 
density  Ap(r)  is  then  obtained  by  expressing  (2)  in 
the  OSO  representation.  From  this  quantity,  a 
new  perturbation  potential  is  constructed  and  so 
on.  Details  of  the  calculations  will  be  given  else- 
where. 

We  turn  now  to  our  results  for  the  isolated  va- 
cancy in  Si.  The  objectives  of  the  calculations 
are  to  investigate  the  following:  (a)  bound-state 
energies  and  wave  functions,  (b)  changes  in  the 
density  of  states  within  the  band  continua  corre- 
sponding to  resonances  and  antiresonances,  and 
(c)  the  charge  distribution  in  the  vicinity  of  the 
vacancy. 

The  calculations  were  performed  using  LCAO’s 
centered  on  the  first  three  shells  of  neighbors 
surrounding  the  vacancy  (total  of  28  atoms)  and 
on  the  vacancy  site.  From  simple  tight-binding 
arguments*  for  an  unreconstructed  vacancy,  one 
expects  bound  states  and  the  most  important 
changes  in  the  density  of  states  to  occur  in  the 
A,  and  Tt  symmetries.  In  Si,  we  find  only  one 
bound  state  within  the  fundamental  gap;  it  has 
T,  symmetry  (threefold  degenerate)  and  lies  at 


0.8  eV  above  valence  bands,  which  is  our  refer- 
ence energy.  Since  the  crystal  is  neutral,  this 
state  contains  only  two  electrons.  The  changes 
in  the  density  of  states  for  Al  and  Tt  symmetries 
are  shown  in  Fig.  1.  From  the  analytic  proper- 
ties of  D(JE)  [Eq.  (1)],  it  can  easily  be  shown  that 
the  total  change  in  the  density  of  states  in  the 
valence  bands,  for  each  symmetry,  must  inte- 
grate to  an  integer.  In  the  case  of  T,  states,  the 
change  is  mostly  negative  and  the  integral  is  - 6, 
thereby  compensating  the  T2  bound  state  in  the 
gap.  In  the  case  of  A,  states,  a sharp  resonance 
is  present  at  - 0.6  eV.  This  resonance  lies  in  an 
energy  region  where  the  local  density  of  states 
of  Aj  symmetry  is  very  small  so  that  the  reso- 
nance is  effectively  a bound  state.  A second  reso- 
nance is  present  at  - 8.1  eV.  Antiresonances 
compensate  so  that  the  total  integral  is  zero. 

The  contour  plot  in  Fig.  2(a)  shows  that  the  total 
change  in  die  charge  density  (ard  therefore  also 
the  corresponding  potential)  is  localized  almost 
entirely  within  the  cavity  defined  by  the  nearest 
neighbors.  Correspondingly,  in  our  basis  (which 
includes  LCAO  on  the  first  three  shells)  the  total 
change  in  the  charge  density  integrates  to  its  full 
value  -4.0  (as  we  have  removed  an  atom  with 


FIG.  1.  The  density  of  states  and  the  change  In  the 
density  of  states  of  A,  and  T2  symmetries.  The  curves 
are  broadened  by  0.2  eV  and  the  reference  energy  Is 
the  top  of  the  valence  bands. 
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FIG.  2.  Contours  of  constant  electron  density  near 
the  vacancy  (in  atomic  units)  for  (a)  the  total  change  in 
Uw  charge  density  Negative  contours);  (b)  the  T2  bound 
state  (see  text);  (c)  the  total  charge  density. 

four  valence  electrons).  Since  the  self-consistent 
procedure  requires  knowledge  of  the  potential 
alone,  this  implies  the  adequacy  of  our  basis  for 
this  purpose.  It  is  interesting  to  note  that  the  in- 
dividual states  of  the  vacancy  are  quite  extended. 
In  fact,  our  basis  contains  only  7Cflo  of  die  bound- 
state  wave  function  (i.e.,  1.4  electrons  instead  of 
2).  A contour  plot  of  the  contributions  of  the  first 
three  shells  to  the  Tt  bound  state  is  shown  in 
Fig.  2(b).  Similarly,  the  change  in  the  density 
of  states  in  the  valence  bands  integrates  to  - 5.2 
electrons  for  die  Ta  symmetry  (instead  of  - 6) , 
and  to  -0.1  for  A,  symmetry  (instead  of  0). 

This  indicates  that  the  extented  bound  state,  reso- 
nances, and  antiresonances  screen  each  other  in 
a rather  intriguing  way,  producing  finally  a very 
localized  potential,  which  in  turn  can  easily  be 
expanded  in  a localized  basis.  It  also  clearly 
demonstrates  die  advantage  of  the  present  formal- 
ism. A cluster , which  would  adequately  repre- 


sent this  infinite  system,  would  need  to  contaiu 
fully  these  extended  states  individually. 

The  total  charge  density  around  the  vacancy  is 
shown  in  Fig.  2(c).  We  observe  that  all  the  bonds 
in  the  crystal  remain  virtually  unchanged,  and 
the  dangling  bonds  remain  pointing  toward  the 
missing  atom.  The  contour  plots  reveal  that  the 
dangling  bonds  have  a rather  constant  charge  den- 
sity near  the  atoms  and  extend  appreciably  into 
the  vacant  site.  The  bound-state  orbital  [Fig. 

2(b)],  although  considerably  delocalized,  has  a 
dangling-bond  character. 

Comparison  of  the  present  self-consistent  re- 
sults with  L-use  obtained  earlier4  using  the  semi- 
empirical  tight-binding  method  (SETBM)  reveals 
the  following:  (1)  The  state-density  changes  are 
strikingly  similar.  (2)  The  position  of  the  bound 
state  in  the  gap  (0.8  eV  versus  0.3  eV  found  ear- 
lier4) is  appreciably  different  and  probably  re- 
flects not  only  self-consistency,  but  mainly  the 
effect  of  the  more  realistically  broad  conduction 
bands  of  the  present  work.  (3)  The  electron-den- 
sity maps  provided  by  the  present  work  confirm 
the  basic  assumption  of  the  SETBM  study,  namely 
that  the  Si-Si  bonds  near  the  vacancy  are  not 
significantly  affected.  The  present  work  also  in- 
dicates that  the  simple  “removal”  model  used  in 
Ref.  4 to  describe  the  vacancy  is  fundamentally 
correct. 

Comparison  with  experiment  is  not  yet  possible 
because  the  vacancy  in  Si  is  known13  to  induce  a 
significant  lattice  reconstruction,  an  effect  which 
has  not  been  included  in  the  present  calculations. 
The  situation  is  comparable  to  free  surfaces, 
where  the  determination  of  reconstruction  is  still 
an  unsolved  problem.  However,  the  bound  state 
we  find  at  0.8  eV  is  consistent  with  experimental 
observation13  according  to  which  the  level  for  the 
neutral  vacancy  is  near  the  bottom  of  the  gap  af- 
ter Jahn-Teller  stabilization  by  about  1 eV.13 

In  conclusion,  we  feel  that  the  present  work 
goes  a long  way  towards  bringing  the  isolated-de- 
fect problem  to  the  level  of  sophistication  which 
characterizes  contemporary  studies  of  crystal 
surfaces;  we  hope  that  theory  will  soon  play  an 
important  role  in  analyzing  the  wealth  of  experi- 
mental data  which  already  exists  in  this  context. 

It  is  a pleasure  to  thank  S.  G.  Louie  and  A.  R. 
Williams  for  valuable  discussions.  This  work  is 
supported  in  part  by  the  U.  S.  Air  Force  Office 
of  Scientific  Research  under  Contract  No.  F49620- 
77-C-0005.  

*W.  Kohn,  in  Solid  State  Physics,  edited  by  H.  Ehren- 
relch,  F.  Seitz,  and  D.  Turnbull  (Academic,  New  York, 


898 


it 


Mi 


— 


Volume  41,  NUMBER  13 


PHYSICAL  REVIEW  LETTERS 


25  Sepiember  1978 


1957),  Vol.  5,  p,  257;  S.  T.  Pantelldes,  Festkorperpro- 
bleme  15,  149  (1975). 

;For  a recent  critical  review  of  the  field,  see  S.  T. 
Pantelldes,  to  be  published. 

3S.  G.  Louie,  M.  SchlUter,  J.  R.  Chelikowsky,  and 
M.  L.  Cohen,  Phys.  Rev.  BU3,  1634  (1976).  In  this 
work,  periodic  boundary  conditions  were  used  (super- 
lattice of  defects).  The  bound  state  had  a dispersion  of 
1 eV  In  the  1-eV  gap  of  SI.  Self-consistent  calculations 
with  even  smaller  clusters  have  been  reported  by 
B.  Cartllng,  J.  Phys.  C 8,  3183  (1975)  and  by  L.  A. 
Hemstreet,  Phys.  Rev.  B 15,  834  (1977). 

'J.  Bernholc  and  S.  T.  Pantelldes,  Phys.  Rev.  B 18, 
1780  (1978). 

5G.  F.  Koster  and  J.  C.  Slater,  Phys.  Rev.  95,  1167 
(1954). 

6J.  Callaway  and  A.  J.  Hughes,  Phys.  Rev.  156,  860 
(1967);  and  164,  1043  (1967). 

tM.  Jaros  and  S.  Brand,  Phys.  Rev.  B 14,  4494  (1976). 
The  method  used  In  this  work  Is  a modification  of  the 
Koster-Slater  approach  Introduced  first  by  F.  Bassanl, 
G.  Iadonisi,  and  B.  Preziosi,  Phys.  Rev.  166.  735 


(1969). 

8J.  Koutecky,  Adv.  Chem.  Phys.  9,  85  (1965);  J.  Poll- 
mann  and  S.  T.  Pantelldes,  Phys.  Rev.  B (to  be  pub- 
lished). 

’J.  Callaway,  J.  Math.  Phys.  (N.  Y.)  5,  783  (1964). 

10D.  J.  Chadi,  Phys.  Rev.  B _16,  790  (1977). 

“Ten  orbitals  per  atom  are  used  as  in  Ref.  10.  The 
use  of  the  same  set  of  orbitals  for  the  defect  problem 
means  that  a total  of  twenty  bands  are  used  for  (he  ex- 
pansion of  the  defect  wave  functions.  (It  was  established 
in  Ref.  8 that  ten  bands  are  entirely  adequate  for  con- 
vergence.) Our  choice  of  orbitals  for  the  defect  prob- 
lem also  eliminates  any  ambiguities  associated  with 
Hilbert  transforms  which  are  necessary  for  the  evalua- 
tion of  Green's  functions  (see  also  Ref.  5).  In  the  case 
of  an  Impurity  with  an  electronic  structure  significantly 
different  from  that  of  the  host  atoms,  additional  LCAO 
orbitals  would  have  to  be  Included  on  the  Impurity  atom. 
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Electron  Capture  into  the  Two-Electron  O State  in  GaP 


T.  N.  Morgan 

IBM  Thomcs  J.  Watson  Research  Center , Yorhtoum  Heights,  New  York  10598 
(Received  23  September  1977) 

A reinterpretation  of  the  ir  spectra  of  Dean  and  Henry  in  oxygen-doped  GaP  strongly 
suggests  that  they  involve  radiative  capture  and  excitation  of  the  second  electron  bound  to 
oxygen.  Further,  this  electron  is  localized  near  the  nearest-neighbor  Ga  ions,  it  is 
bound  by  ~0.9  eV  as  found  also  by  Samuelson  and  Monemar,  and  its  electron-phonon  cou- 
pling is  small — A *0.6,  Aw  *47.5  meV — in  disagreement  with  Henry  et  at.  The  analysis 
emphasizes  the  usefulness  of  the  “phonon  signature’’  in  identifying  deep  levels  from  their 
optical  spectra. 


Dean  and  Henry  (DH)  have  published  a careful 
study  of  radiative  electron  capture  onto  deep  oxy- 
gen donors  in  GaP.1  Subsequently  Henry  and  co- 
workers*  ■*  discovered  by  photocapacitance  tech- 
niques that  each  oxygen  donor  in  GaP  could  bind 
a second  electron  to  become  O',  with  each  elec- 
tron bound  by  nearly  1 eV.  Since  this  latter  dis- 
covery both  states  have  been  studied  extensively, 
and  two  conflicting  models  for  the  two-electron 
state  have  developed — that  of  Henry  and  co-work- 
ers**4 and  that  of  Grimmeiss  et  al.*  and  of  Mor- 
gan.® The  former  authors  conclude  that  a very 
strong  electron-lattice  interaction  is  acting  and, 
indeed,  that  GaPK)  is  a prime  example  of  such 
interactions  in  semiconductors.4  The  latter  au- 
thors conclude  that  a kind  of  selection  rule  must 
be  operating,  so  that  optical  transitions  between 
the  conduction -band  edge  and  the  two-electron 

ground  state  of  O'  are  very  weak.  In  this  Letter 
I reexamine  the  DH  data  and  propose  that  they  in- 
volve the  second  electron  in  O*  and  allow  impor- 
tant properties  of  this  anomalous  state  to  be  de- 
termined. 

An  indication  that  the  original  one-electron  in- 
terpretation of  the  DH  data  is  incorrect  is  given 
by  a comparison  of  the  shape  of  the  phonon  rep- 
lica structure  in  the  luminescence  spectrum  of 
DH  with  that  of  the  donor -acceptor  (D-A)  pair 
spectra  of  Dean,  Henry,  and  Frosch,7  both  of 


which  are  shown  in  Fig.  1.  Monemar  and  Samu- 
elson have  recently  found*  that  the  same  phonon 
energies  [Kul=  19  meV  and  JFw2  = 48  meV  (see  al- 
so Ref.  7 )j  and  relatively  large  coupling  strengths 
[x,=  1.65  and  X2  = l.l]  which  fit  the  pair  (emis- 
sion) spectra  also  explain  the  shape  and  tempera- 
ture dependence  of  the  photoneutraiization  (ab- 
sorption) spectra  for  the  0°  ground  state.  [These 
are  referred  to  as  the  configuration-coordinate 
(C-C)  modes.]  These  phonons  do  not  appear  in 
other  D-A  pair  spectra.  Thus,  this  satellite 
structure  may  be  considered  the  " signature”  of 
the  deep  0°  ground  state  with  respect  to  its  one- 
electron  transitions. 

The  spectra  in  DH,  however,  are  very  differ- 
ent— in  part  because  the  zero-phonon  transition 
O0  ( see  Fig.  1)  is  forbidden  and  cooperation  of  a 
nonsymmetrtc  (active)  phonon  is  needed  to  relax 
the  selection  rule  and  produce  a strong  replica, 
and  in  part  because  the  states  are  different.  If 
the  deep  electron  states  involved  in  both  of  these 
spectra  were  the  same,  the  C-C  phonons  and 
coupling  strengths  would  be  nearly  the  same  (be- 
ing only  weakly  dependent  on  the  shallow  state  in- 
volved), and  the  same  replicas  of  any  strong  fea- 
tures which  appear  in  one  spectrum  would  appear 
in  both.  Thus  there  would  appear  in  Fig.  1(a)  a 
strong  broad  19-meV  replica  of  0]oe'  and  also  of 
0TA  and  of  the  optical  phonon  peaks,  none  of 


190 


© 1978  The  American  Physical  Society 


Volume  40,  Number  } 


PHYSICAL  REVIEW'  LETTERS 


16  Januaby  1978 


FIG.  1.  Photoluminescence  from  O -doped  GaP  at  low  temperature:  (a)  capture  spectrum  from  Ref.  1.  showing 
C-C  phonon  ug  and  the  missing  phonon  u,;  (b)  pair  spectrum  from  Ref.  8 showing  the  phonons  and  the  stepwise  de- 
convolution.  Note  the  change  in  energy  scale. 


which  is  present.  Therefore,  the  deep  electron 
states  responsible  for  the  two  spectra  are  not  the 
same ! Since  the  state  involved  in  the  pair  spec  - 
tra  is  known  from  many  independent  measure- 
ments to  be  the  neutral  oxygen  donor,  we  con- 
clude that  the  oxygen  centers  studied  by  DH  are 
in  the  two-electron  O'  state. 

The  conclusion  that  these  spectra  involve  O'  is 
reinforced  by  another  part  of  the  DE  study.  In 
the  n-type  sample  used  for  the  excitation  spec- 
trum shown  in  Fig.  6(a)  of  Ref.  1,  one  expects 
the  two-electron  oxygen  species  to  be  dominant, 
n.»n0»n+,  and  the  sharp-line  spectrum  shown 
in  Fig.  6(a)  to  arise  from  the  transition  O' -(O')* 
of  the  two-electron  oxygen  state.  This  conclusion 
is  supported  by  the  DH  finding  that  this  spectrum 
is  observable  only  in  n-type  crystals  containing 
oxygen,  although  this  argument  is  not  conclusive.9 
Since  these  zero-phonon  transitions  are  optically 
allowed,  strong  phonon  replicas  are  generated 
only  by  A,-type  C-C  modes  to  which  the  state  is 
coupled.  There  are  two  of  these  modes,  B'  and 
C',  identified  by  DH  in  their  Fig.  6(a)  and  listed 
in  their  Table  n.  The  primes  have  been  added  to 
distinguish  them  from  the  non-A  t active  modes 
B and  C found  in  the  electron-capture  spectrum, 
Table  I of  DH. ) The  energies  and  approximate 
coupling  strengths  are  = 46.42 ± 0.07  meV, 
Aa.sO.4,  and  ftuC'  = 49.1  ± 0.1  meV,  Ac»*0.2. 

There  is,  again,  no  indication  of  the  prominant 
19-meV  mode  associated  with  O0,  thus  support- 
ing the  identification  of  this  as  an  O'  spectrum! 
Further,  the  persistent  phonon  in  the  lumines- 
cence spectrum,  Fig.  1 and  Table  I of  DH,  which 
has  an  energy  of  ffu>0=47.53±  0.09  meV  and  a 
strength  of  \0~  0.6,  is  simply  the  unresolved  sum 
of  B'  and  C' . The  agreement  in  energy  and 
strength  of  the  phonons  in  these  luminescence 
and  excitation  spectra  confirms  that  the  deep 


states  involved  are  the  same  in  both. 

The  above  interpretation  of  the  data  provides  a 
simple  resolution  of  several  anomalies  reported 
in  the  DH  paper.  One  of  these  was  the  quenching 
with  increasing  acceptor  concentration  of  the 
electron -capture  luminescence  relative  to  the 
oxygen-acceptor  pair  luminescence.  It  was  ex- 
pected that  the  capture  luminescence  would  be 
proportional  to  the  pair  luminescence,  since  elec- 
trons captured  radiatively  on  the  O donors  should 
then  recombine  with  holes  on  acceptors.  Instead, 
DH  found  that  with  increased  zinc  doping  the  pair 
luminescence  became  dominant  while  the  capture 
luminescence  disappeared.  Somehow,  the  elec- 
trons were  being  captured  on  the  O donors  with- 
out emitting  the  characteristic  spectrum. 

The  resolution  of  this  problem  is  now  clear. 
Electrons  are  captured  with  no  identified  radia- 
tion [possibly  through  a deeper  (O0)*  level]  into 
the  O0  ground  state  from  which  they  recombine 
rapidly  with  bound  holes.  Thus  the  neutral  oxy- 
gen concentration  is  too  low  to  give  an  observable 
rate  of  capture  into  the  O'  state,  which  is  the 
radiative  one. 

Further,  the  absence  of  saturation  of  the  cap- 
ture luminescence  under  strong  excitation,  for 
which  the  D-A  pair  luminescence  saturates,  is 
also  easily  understood.  The  flow  of  electrons 
through  the  radiative  O'  channel — O0  + e'-O' 

+ hv,  followed  by  subsequent  annihilation  of  one 
of  the  bound  electrons  by  a hole — is  independent 
of  the  pair  process  and  is  not  limited  by  the  small 
saturable  transition  probabilities  of  pair  recom- 
bination. 

By  substituting  O18  for  the  normal  O18  isotope 
in  their  samples,  DH  were  able  to  shift  the  0.841- 
eV  emission  line  by  0.67  ± 0.05  meV  to  lower  en- 
ergies— an  amount  equal,  within  experimental 
error,  to  the  corresponding  shift  (0.71  ± 0.02 
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meV)  to  higher  energy  of  the  pair  spectra  report- 
ed by  Dean,  Henry,  and  Frosch  in  Ref.  7.  This 
agreement  suggested  that  the  oxygen  states  ob- 
served in  the  two  experiments  were  the  same. 
This  argument  is  not  conclusive,  however,  be- 
cause a similar  reduction  in  the  local  elastic  con- 
stants for  the  active  oxygen  modes  (and,  hence, 
a similar  isotope  shift)  is  expected  from  the  bind- 
ing of  each  (antibonding)  electron  in  a deep  state. 

It  is  useful  to  construct  models  for  the  C-C 
modes,  since  these  can  give  information  about 
the  localization  of  the  electronic  states.  Briefly, 
one  expects  these  to  be  breathing-type  modes  in- 
volving the  nearer  neighbors,  with  the  lower-en- 
ergy 19-meV  mode  consisting  mainly  of  vibration 
of  the  four  (heavier)  nearest-neighbor  Ga  ions 
and  with  the  48-meV  modes  involving  mainly  the 
twelve  second-nearest-neighbor  P ions.  Thus, 
the  19-meV  mode  is  excited  predominantly  by  a 
change  in  electron  density  in  the  immediate  vi- 
cinity of  the  O impurity  and  couples  to  the  deep 
O0  ground  state,  as  experiment  confirms.  The 
absence  of  this  mode  in  the  O*  spectra  indicates 
that  capture  of  the  second  electron  does  not  alter 
appreciably  the  charge  density  in  the  "central 
cell'’  region  and  that  the  second  electron  is  local- 
ized near  the  four  nearest-neighbor  Ga  ions.  The 
48-meV  modes  appear  in  both  the  0°  and  O*  spec- 
tra, in  agreement  with  this  model  and  with  the 
interpretation  of  these  as  "phosphorus”  modes. 
The  weaker  coupling  to  these  (by  about  half)  for 
the  second  electron  is  consistent  with  a more  ex- 
tended wave  function.  A simple  electrostatic 
model  confirms  that  the  coupling  strengths  are 
all  reasonable. 

The  lowest  excited  two-electron  states  (O’)* 
are  expected  to  contain  one  electron  in  a local- 
ized state — similar  to  Oq — and  one  in  an  extend- 
ed (effective  mass)  multivalley  state.  Hence,  the 
ionization  threshold  of  O'  should  lie  slightly 
above  the  zero-phonon  lines  of  Figs.  1 and  6(a)  of 
DH  at  ~ 0.9  eV.  These  conclusions  agree  well 
with  the  recent  low-temperature  photoionization 
experiments  by  Samuelson  and  Monemar10  in 
which  they  established  that  the  threshold  for  dou- 
ble ionization  of  the  O'  state  is  about  1.8  eV,  or 
~ 0.9  eV  above  the  0°  ionization  threshold,  and 
Out  the  corresponding  phonon  coupling  is  not 
large. 

The  evidence  cited  above  confirms  that  the  ion- 
ization energy  of  the  second  electron  in  O'  is 
about  0.9  eV,  a value  also  consistent  with  the 
thermal  activation  energy  of  0.89  eV  (Ref.  4)  and 
with  optical  values  of  0.8-0. 9 eV  found  by  Ledebo 


and  Ovren.11  These  values  are  in  contradiction, 
however,  with  the  interpretation  of  Grimmeiss 
et  cd.*  (which  assigns  a photoionization  threshold 
at  0.65  eV  to  oxygen)  as  well  as  with  that  of  Hen- 
ry and  co-workers**4  (which  admits  of  no  signifi- 
cant thresholds  below  2.0  eV). 

Henry  and  Lang4  have  pointed  out  in  their  Ap- 
pendix A that  small  concentrations  of  deep  cen- 
ters other  than  oxygen  may  account  for  some  of 
the  thresholds  observed  in  GaP  by  Grimmeiss 
et  al.*  The  evidence  cited  above  appears  to  con- 
firm this  explanation,  at  least  for  the  lowest  0.65- 
eV  threshold.  In  a reinterpretation  of  these  data, 
which  will  be  published  separately,  I note  that 
the  weak  0.86-  and  1.38-eV  thresholds  probably 
correspond  to  transitions  from  O*  into  the  Xl 
and  T,  minima,  while  the  1.77-  and  2.15-eV 
thresholds  involve  two-electron  transitions.4  Of 
the  latter,  the  last  (and  strongest)  agrees  with 
the  only  transition  (£,,  = 2.15  eV  at  300  K)4  recog- 
nized in  Ref.  4 as  belonging  to  O*,  while  the 
first  matches  the  two-electron  structure  of  Ref. 

10. 

It  is  apparent  from  the  above  discussion,  as 
well  as  from  Fig.  11  of  Ref.  4,  that  the  oscillator 
strengths  are  abnormally  small  for  all  O'  photo- 
ionization transitions  within  about  1 eV  of  the 
threshold,  and  that  the  rate  does  not  become  ap- 
preciable at  room  temperature  until  the  “two- 
electron”  transitions  begin  near  1.77  eV.  This 
absence  of  the  low-energy  oscillator  strength  can- 
not be  attributed  to  strong  electron-phonon  coup- 
ling, as  has  been  proposed,2*4  because  of  the 
small  value  of  the  coupling  strength  (*  = 0.6)  es- 
tablished above.  For  the  same  reason,  the  usual 
explanation  of  the  strong  temperature  dependence 
of  the  photocapacitance  data  (and  the  some- 
what weaker  dependence  found  for  oM°)  in  terms 
of  the  electron-lattice  coupling4  is  unacceptable. 

These  facts  lead  to  the  important  conclusion 
that  the  temperature  dependence  of  such  spectra 
is  not,  by  itself,  a reliable  indicator  of  the  C-C 
electron-phonon  coupling  strength  and  that  it 
must,  therefore,  have  another  explanation.  I 
propose  that  the  evidence  strongly  supports  the 
suggestion  of  Refs.  5 and  6 that  it  is  the  small- 
ness of  the  electronic  matrix  element  which  sup- 
presses the  one-electron  transitions  (those  be- 
low about  1.8  eV)  and  that  the  relaxation  of  this 
“ selection  rule”  by  phonon  cooperation  as  the 
temperature  is  raised  gradually  turns  on  the  one- 
electron  transitions  and  accounts  for  the  strong 
temperature  dependence.  Note,  further,  that 
these  properties  are  not  adequately  accounted  for 


Volumi  40,  Numbu  3 


PHYSICAL  REVIEW  LETTERS 


16  Januaxy  1978 


by  the  recent  theories  of  Jaros  and  co-workers. ia 
I shall  consider  possible  origins  of  these  effects 
and  of  the  large  two-electron  binding  energy  in  a 
later  publication. 

The  most  compelling  of  the  many  facts  support- 
ing this  new  interpretation  are  the  anomalous 
phonon  structure  in  the  DH  spectra  and  the  agree- 
ment with  the  two-electron  excitation  spectra  of 
Samuelson  and  Monemar.10  On  the  other  side  one 
could  cite  the  argument  of  Dean*  and  the  similar- 
ities found  by  Carter  et  al.Xi  between  the  excited- 
state  energies  observed  by  DH  and  those  expected 
and  observed  for  shallow  donors  in  GaP.  It  is 
hoped  that  this  Letter  will  stimulate  the  crucial 
experiments  which  will  clarify  the  nature  of  this 
most  peculiar  deep  center. 
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ABSTRACT  It  is  shown  that  the  multivalley  problem  for  donors  in  Si  and  GaP  is  intrinsically  a 
two-band  problem.  One-band  multivalley  equations  for  Ge  and  two-band  multivalley  equa- 
tions for  Si  and  GaP  are  derived  by  evaluating  all  terms  to  the  same  order.  Intervalley 
kinetic-energy  matrix  elements,  for  which  conflicting  accounts  appeared  recently,  are  evaluated 
explicitly  and  shown  to  provide  an  excellent  measure  of  the  validity  of  the  theory.  The 
retention  of  Umklapp  terms  in  potential-energy  matrix  elements,  which  has  recently  been 
demonstrated  to  be  important  for  quantitative  calculations,  gives  rise  to  additional  terms,  not 
considered  previously,  which  are  of  the  same  order  as  other  terms  already  included.  It  also 
leads  to  the  requirement  that  the  impurity  pseudopotential  must  be  internally  consistent  with 
the  host-crystal  pseudopotential.  As  a consequence,  when  an  empirical-pseudopotential 
band-structure  calculation  is  used,  the  screened  point-charge  potential  no  longer  corresponds 
specifically  to  the  isocoric  impurity.  Overall,  the  construction  of  appropriate  impurity  pseudo- 
potentials is  a more  sensitive  task  than  in  earlier  theories  that  neglected  Umklapp  terms. 
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Recently,  a number  of  authors1 2*4  independently  showed 
that  the  conventional  one-band  multivalley  effective-mass 
(EM)  equations  for  impurities  in  semiconductors,  which  have 
been  widely  used5 * * * **10  to  study  donors  in  Si  and  Ge,  have  a 
major  shortcoming:  The  Umklapp  terms  in  the  intervalley 
potential-energy  (PE)  matrix  elements,  which  were  left  out  in 
previous  theories,  are  in  fact  dominant.  Shindo  and  Nara1 
further  suggested  that  intervalley  kinetic-energy  (KE)  matrix 
elements  should  be  completely  absent  and  used  the  resulting 
equation  for  some  preliminary  calculations  with  model  impurity 
potentials  describing  shallow  donors  in  Si.  Altarelli  et  al.3 
adopted  the  same  point  of  view  about  intervalley  KE  terms  and 
carried  out  calculations  using  the  screened  point-charge  poten- 
tial, which  was  taken  to  be  a good  approximation  for  the  im- 
purity potential  of  isocoric  impurities  (Si:P  and  Ge:As),  as 
shown  earlier  in  the  no-Umklapp  theory  of  Ref.  9.  Herbert 
and  Inkson,4  on  the  other  hand,  pointed  out  that  the  interval- 
ley KE  terms  are  not  rigorously  zero  and  suggested  a new 
expression  for  them.  They  made  use  of  model  impurity  poten- 
tials and  presented  results  which  demonstrated  that  the  inter- 
valley KE  terms  can  play  a crucial  role  in  variational  calcula- 
tions. 

In  this  paper  we  report  the  results  of  a systematic  sMidy  of 
multivalley  EM  equations.  In  particular,  we  focus  on  the  fol- 
lowing results: 

1.  We  derive  one-band  multivalley  EM  equations  by  evaluat- 
ing all  matrix  elements  to  the  same  order.  We  show  that, 
when  Umklapp  terms  are  retained,  additional  terms  appear 
in  the  PE  matrix  elements  which  have  not  been  considered 
before,  but  are  of  the  same  order  as  other  terms  already 
included.  Intervalley  KE  matrix  elements  are  not  rigor- 
ously zero,  but  are  negligibly  small  for  shallow  donors  in 
Ge.  Further,  it  is  shown  that  the  approximations  of  the 
theory  break  down  when  the  intervalley  KE  terms  become 
appreciable. 

2.  We  point  out  that  the  multivalley  problem  for  Si  and  GaP 

is  intrinsically  a two-band  problem.  We  derive  the  appro- 

priate two-band  multivalley  EM  equations  and  discuss 

their  solutions.  For  Si,  these  equations  can  be  cast  in  a 

form  which  is  analogous  to  that  of  one-band  multivalley 

equations  with  an  explicit  intervalley  KE  term.  They  can 

be  further  reduced  to  those  used  in  Refs.  1-4  if  additional 

approximations  are  made.  Finally,  the  equations  reduce  to 

the  usual  one-band  one-valley  EM  equation11  when  all 
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interband,  intervalley  and  Umltlapp  terms  are  left  out.  In 
contrast,  for  GaP,  the  two-band  equations  cannot  be  re- 
duced to  simpler  forms. 

3.  We  point  out  that,  when  Umltlapp  terms  are  retained,  the 
impurity  pseudopotential  must  satisfy  a requirement  which 
we  refer  to  as  internal  consistency.  As  a consequence, 
when  an  empirical-pseudopotential  band  structure  is  used, 
the  screened  point-charge  potential  does  not  necessarily 
correspond  to  the  impurity  pseudopotential  of  the  isocoric 
impurity,  but  could  yield  any  binding  energy  within  the 
range  of  shallow-donor  binding  energies.  We  conclude 
that  the  construction  of  impurity  pseudopotentials  and  the 
solution  of  the  equations  are  far  more  demanding  tasks 
than  they  were  in  the  case  of  previous  no-Umklapp 
theories. 6-1 1 

We  have  developed  one-band  multivalley  EM  equations, 
which  are  appropriate  for  Ge,  and  two-band  multivalley  EM 
equations,  which  are  appropriate  for  Si  nnd  GaP.  In  both 
cases,  we  start  with  a one-electron  local-der  sity12  Hamiltonian 
H+U,  where  H is  the  perfect-crystal  Hamiltonian  and  U is  the 
impurity  potential,  and  make  only  two  additional  approxima- 
tions. It  then  becomes  apparent  what  additional  approxima- 
tions and  assumptions  are  necessary  to  reduce  our  results  to 
any  of  the  previous  theories. 

We  begin  by  assuming  that  the  perfect-crystal  band- 
structure  problem 

H*„k(r)«En(k)*nk(r>  (1) 

has  been  solved.  We  then  expand  the  bound-state  wavefunc- 
tion  $(r)  in  the  form 

♦<£>  “ \ Fn<!5>  *nk(£>-  (2) 

Qk 

We  treat  the  Fn(k)  as  variational  functions  and  without  loss  of 
accuracy  or  generality  we  choose  them  to  be  of  the  form 

Fn  00  - 5 K FS  00.  (3) 

fl"  1 

where  F“(k)  is  centered  about  k£,  which  are  a few  selected 
points  in  the  Brillouin  zone  [e.g.  where  En(k)  has  its  extrema]. 
The  coefficients  A“  can  be  determined  by  symmetry  only  when 
the  points  k||  are  equivalent. 

The  first  approximation  we  wish  to  make  in  the  case  of 
donors  is  to  retain  only  the  lowest  band  n-1  and  only  the 
F“(Jk)  that  correspond  to  the  N equivalent  absolute  minima. 
The  criterion  for  the  validity  of  this  assumption  is  easily  estab- 
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lished  by  perturbation  theory  to  be  EB/AE  <<  1,  where  EB  is 
the  binding  energy.  In  the  case  of  subsidiary  minima,  AE  is  the 
energy  separation  between  the  minima,  whereas  in  the  case  of 
other  bands,  AE  is  the  minimum  energy  separation  (direct  or 
indirect)  from  the  absolute  minima. 

In  the  case  of  Ge,  the  above  assumptions  are  valid  for 
shallow  donors  and  we  proceed  to  develop  one-band  multival- 
ley EM  equations.  We  immediately  get  for  the  expectation 
value  of  the  energy 

E[F]  - 2 2 A?  X?  [<H>afl  + <U>“*]  / 2 2 A®  A*  S®*  (4) 
a / J 1 1 a P 1 1 

where 

<H>“*  - 2 F“  (k)*<*lk  I H | *llt>  F?  (It),  (5) 

k ' ^ ***  ***  ^ 

<U>®*  - 2 2 F?  (k)*  <*|k  | U I *,k'>  F*  (k*),  (6) 

k k'  - - 

and 

s«/»  _ 2 F“  (k)*F?(k).  (7) 

We  distinguish  between  intravalley  ( ) and  intervalley 
(a#/3)  terms.  Note  that  in  general13  S“^  #0  for  all  a,  p. 

We  now  proceed  to  evaluate  eqs.  (3)  and  (6)  to  the  same 
order.14  For  this  purpose,  ^,k(r)  is  expanded  about  appropriate 
points  K,  using  k - g theory. The  leading  terms  are 

*ik  <£>  “ *ik  (£)  “ [ ^ t k,<£>  + 

(k-Kr)  • 2 %WnK(r)l  eW'i.  (8) 

~ ~ n#l  E|n  ~r  J 

where  we  use  atomic  units, 

Eln  - <“nKr  I £ I uIKr>  - - ><unK,,  I 7 I U1K„> 

[unk(r)  is  the  cell-periodic  part  of  fnk(r)],  and  E[n  - 
Et(K,)-En(Kr).  Our  second  approximation  is  that  F®(k)  is 
sufficiently  localized  about  k®  to  allow  the  use  of  k • g expan- 
sions about  these  points.  Such  expansions  allow  us  to  evaluate 
all  but  the  intervalley  KE  matrix  elements  to  order11  (k-k.)2 
[from  now  on  we  use  k^,  in  place  of  k®  for  clarity].  We  define 


f^(r)  - 2 F®  (fc)  ei(Ha)-£ 

*0 

(10) 

and  get 

<H>®“  - <fj  |T“  (-  i V)  | f^>. 

(11) 

and 

<U>“*  - <ff  | U“*  | f{>, 

(12) 

* awn  rwtP  i 


Page  4 


where 

T-(-i7)  --I72-  2 l£ln  -V  | 2 / H?n  . (13) 

- 2 n#l  * ~ 

[usually  written  in  the  form16 

T*(-i?)  - - 2(l/2m*)  (dVdx2)  (14) 

~ i 1 1 

with  m*  being  the  effective  masses  at  the  ath  extremum],  and 

Va*  - if/ iiyjCr)  + 

-i  W^(r)U(r)  aJi(jEln/  Hfn)^(r)].7 

+i-‘I«#l(£i’,/  E7n)  ^(£)U(£>*>V£)J-  (,5) 

The  second  and  third  terms  in  (IS),  which  have  not  been  con* 
sidered  before,  must  be  retained  because  they  are  of  the  same 
order17  as  the  second  term  in  (13),  which  is  responsible  for 
converting  the  free-electron  mass  to  m*.  Their  actual  contrib- 
ution will  of  course  depend  on  the  particular  choice  of  U(r). 
In  the  limit  of  assuming  a weak  potential  U and  dropping  all 
Umklapp19  and  intervalley  terms,11  (IS)  reduces  to  simply 
U(r)3„0,  and  the  one-band  one-valley  EM  equation11  follows. 

Finally,  for  the  intervalley  KE  matrix  elements,  which  are 
clearly  not  rigorously  zero,  expansion  of  i£Ilt(r)  about  either  k 
or  would  not  be  correct  unless  (k-k^4  and  higher-order 
terms  were  kept.  Instead,  we  note  that  the  sum  over  k is  dom- 
inated  by  values  of  k where  F“(k)  Ei(k)  F?(k)  is  large.  A 
convenient  point  of  expansion  for  is  therefore  the  midpoint 
between  k^,  and  k^,  which  we  denote  by  k^.  Using  eq.  (8) 
with  £r  ■ and  keeping  the  lowest  order,  we  get 

<H>“*  - E,^)  SaP,  a+p.  (16) 

In  Ge,  we  can  use  one-valley  envelope  functions  to  estimate 
S°^  and  find  that  <H>“^  is  of  order  10*5  meV  and  thus  totally 
negligible.  In  other  materials  with  larger  effective  masses,  the 
value  of  <H>"0  given  by  eq.  (16)  can  be  appreciable.  Higher- 
order  terms  would  then  have  to  be  included  in  eq.  (16),  which, 
in  general,  reduce  the  value  of  <H>“^.  However,  an  estimate 
of  the  (k-k,,)4  and  higher-order  corrections  to  the  intravalley 
KE  matrix  elements  shows18  that  they  are  also  of  order 
i»Ei(kA^)S°^,  where  if  is,  of  order  unity.  Therefore,  in  each 
application,  evaluation  of  <H>“8,  using  eq.  (16)  pro- 
vides an  excellent  measure  of  the  uncertainty  of  the  calcula- 
tion. Clearly,  when  it  becomes  appreciable,  the  (k-k,)4  terms 
become  important  and  a higher-order  theory  would  have  to  be 


developed  or  the  procedure  abandoned  in  favor  of  other 
methods.20*21 

We  now  turn  to  Si.  In  Fig.  1(a)  we  show  the  lowest  con- 
duction band  of  Si  along  the  (001)  direction  in  the  standard 
Brillouin  zone  centered  at  I*.  The  dashed  lines  are  a second 
band,  which  is  not  explicitly  included  in  the  derivation  of  the 
one-valley  EM  equation1 1 since  it  is  well-separated  from  the 
minimum  at  k0.  In  a multivalley  theory,  however,  it  is  the 
indirect  interband  separation  that  is  relevant,  so  that  Fig.  1(a) 
suggests  that  a two-band  formulation  may  be  necessary.  The 
importance  of  the  second  band  becomes  even  more  compelling 
if  we  view  the  Si  bands  from  a Brillouin  zone  centered  at  X,  as 
shown  in  Fig.  1(b).  It  immediately  becomes  clear  that  the 
multivalley  problem  in  Si  is  intrinsically  a two-band  problem. 

The  two-band  theory  is  developed  in  two  stages.  First, 
states  <t>a(r)  are  constructed  from  the  two  bands  around  the  ath 
X point,  Xa.  For  this  purpose,  degenerate  k«g  theory13  about 
Xa  is  employed,  keeping  terms  up  to  order  (k-Xa)2.  The  re- 
sulting dispersion  describes  the  two-valley  (divale)  structure 
more  accurately  than  individual  expansions  to  second  order 
about  the  two  minima.22  Depending  on  the  choice  of  the  two 
degenerate  Bloch  functions  at  Xa,  different  but  equivalent  EM 
equations  are  obtained.  Making  use  of  the  full  symmetry  at 
the  X point,  the  resulting  single-divale  EM  equations  are 

[Tq(-i7)I+Tc(-iV)o2  + T^(-iV)ay 

+ U“]  f“  (r)  -Ef°  (r),  (17) 

where  the  zero  of  energy  is  now  at  the  X point,  I is  the  2x2 
unit  matrix,  ay  and  az  are  Pauli  spinors,  and  f°(r)  is  a two- 
component  column  vector.23  The  remaining  quantities  are 
defined  by 

Tq  (£“2P  m2sr + ky)  + 2i(k*-x)2* 


Tc(k  -X)m±  kxky. 


T ( (k-X)  - — (VXHVX),  (20) 

where  we  have  chosen  Xa  — (001)(2#/a).  Here  mx  and  m| 
are  the  usual  transverse  and  longitudinal  effective  masses, 
respectively,  at  each  minimum,  and  mc  is  an  interband  effective 
mass  which  must  either  be  calculated  from  the  appropriate  k*£ 


expression  or  fit  to  calculated  energy  bands.18  The  operator 
U“a  is  also  a 2x2  matrix,  where  Ujj  is  an  appropriate  generali- 
zation of  eq.  (IS).  If  the  interband  KE  term  given  by  eq.  (19) 
and  all  Umklapp  terms  are  neglected,  then  ■ 6-  U(r),  the 
two  bands  are  decoupled  exactly,  and  (17)  reduces  to  two 
equations,  one  at  each  valley,  which  have  the  standard  one- 
valley  form". 

The  solutions  of  (17)  may  be  classified  as  symmetric  and 
antisymmetric  with  respect  to  z.  Symmetric  solutions  can  be 
obtained  by  taking  f“(r)  to  be  an  even  function  and  f“(r)  to  be 
an  odd  function.  For  antisymmetric  solutions  the  opposite 
choice  must  be  made.  The  difference  in  the  energies  of  the 
two  types  of  solutions  arises  mainly  from  the  fact  that,  to 

~ aa  . ^ ao  • 

leading  order,  Ut,  (r)  ~ coszXz  and  U22  (r)  ~ sinzXz.  Symme- 
tric states,  therefore,  sample  the  full  strength  of  the  potential 
near  the  origin  and  would  be  deeper  for  an  attractive  core. 

In  order  to  complete  the  theory  for  Si,  eigensolutions  <*>(r) 
are  constructed  by  writing 

3 

*(r)  -Z  \a  *"(r)  (21) 

~ a—  1 

Symmetric  <f>a’s  which  are  non-degenerate  in  the  space  of  a 
single  Xa  give  rise  to  a singlet  A(  and  a doublet  E,  whereas 
antisymmetric,  similarly  non-degenerate  <t>a’s  give  rise  to  a 
triplet  T2.24  The  three  \a  for  the  latter  are  of  the  form  0,0,1, 
so  that  a single-divale  theory  is  adequate.  For  A,  and  E states, 
multidivale  equations  are  constructed  following  the  same  pro- 
cedure used  for  the  one-band  multivalley  theory  with  obvious 
generalizations. 18 

An  alternative  form  of  the  two-band  equations  for  Si  can 
be  obtained  by  making  use  of  the  symmetry  present  along  the 
(001)  axis  instead  of  the  full  symmetry  at  X.  The  resulting 
equations  can  then  be  cast  in  a form  that  is  analogous  to  the 
one-band  multivalley  equations  derived  above  for  Ge,  except 
that  now  the  term  Tc(k-X)  of  eq.  (19)  appears  explicitly  as  the 
intervalley  KE  for  the  two  valleys  on  the  same  (001)  axis.18 
The  particular  form  of  the  one-band  multivalley  equations  used 
in  Refs.  1-4  can  then  be  recovered  by  omitting  Tc  as  well  as 
the  second  and  third  terms  in  eq.  (IS)  in  all  the  PE  matrix 
elements,  and  making  some  additional  minor  approximations.23 
The  importance  of  the  omitted  terms  and  of  the  other  approxi- 
mations depends  on  the  host  band  structure  and  on  the  choice 
of  impurity  pseudopotential.  Before  we  address  these  subjects, 
however,  we  first  complete  our  derivations  of  multivalley  EM 
equations. 
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The  two-band  multidivale  EM  equations  derived  above  for 
Si  can  immediately  be  extended  to  describe  the  camel’s-back 
extrema  of  GaP.  The  lowest  conduction  bands  of  GaP  near 
the  X point  look  like  those  of  Si  (Fig.  lb)  except  that  the 
states  at  X are  split  by  the  antisymmetric  part  of  the  crystal 
potential  giving  rise  to  a camel's-back  shape  (Fig.  lc).26  The 
appropriate  single-divale  equation  is 

[Tq(-i7)I+Tc(-iV)oz  +T/  (-i7)  +|o, 

+ U~]  f*(r)  - Ef“  (r),  (22) 

where  Tq,  Tc  and  T / are  the  same  as  in  eq.  (17),  the  a’ s are 
Pauli  spinors,  and  A is  the  band  splitting  at  X.  We  note  that 
even  if  A is  large  compared  with  EB.  and  even  if  Umklapp 
terms  are  dropped  so  that  U^  ( r ) ~ijj  U(0,  eq.  (22)  cannot  be 
decoupled  into  two  one-band  equations.27  The  generalization 
to  a multidivale  theory  is  straightforward.18 

We  turn  now  to  the  choice  of  impurity  pseudopotentials 
which  must  be  used  with  the  above  equations.  For  substitu- 
tional impurities,  one  may  write 

U(r)  - Vj  (r)  - v„  (r)  + screening,  (23) 

where  v;  (r)  and  vo(0  are  pseudopotentials  for  the  impurity 
and  host  ion,  respectively.  In  a theory  which  neglects  all  Umk- 
lapp terms,28  one  must  simply  make  sure  that  the  choice  of 
pseudopotentials  is  such  that  U(£)  is  weak  for  the  EM  approxi- 
mations to  be  valid.29  When  Umklapp  terms  are  included  in  the 
EM  equations,  however,  one  also  needs  Bloch  functions  for  the 
host  crystal.  Since  Bloch  functions  depend  on  the  choice  of 
crystal  pseudopotential,  internal  consistency29  requires  that  the 
crystal  pseudopotential  be  given  by 

V(r)  - 2 v0  (r-Rj)  + Vv  (r),  (24) 

where  v0(r-Rj)  is  the  same  as  in  eq.  (23).  Here  Vv  ( r ) is  the 
potential  of  the  valence-electron  cloud.  The  importance  of 
violating  the  requirement  of  internal  consistency  in  numerical 
calculations  has  not  been  investigated.29  The  requirement, 
however,  limits  the  usefulness  of  the  screened  point-charge 
potential  U^fr).  It  was  shown  in  Ref.  9 that  U^fr)  is  a good 
approximation  for  an  isocoric  impurity  if  one  works  within  a 
true-potential  representation  or  the  Cohen-Heine30  optimized- 
pseudopotential  representation  where  both  v0(r)  and  Vj(r) 
retain  their  Coulombic  spikes  at  the  origin.  If,  however,  the 
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host  band-structure  calculation  is  carried  out  using  a model 
potential,  as  in  the  empirical-pseudopotential  method,31  vQ(r)  is 
fixed32  and  U^fr)  cannot  be  unambiguously  identified  with  the 
impurity  pseudopotential  of  any  specific  shallow  donor.  In- 
stead, its  binding  energy  will  depend  on  the  choice  of  pseudo- 
potential used  for  the  band-structure  calculation  that  yields  the 
necessary  Bloch  functions.  For  commonly  used  choices,  the 
calculations  of  Ref.  3 suggest  that  the  resulting  binding  ener- 
gies would  lie  in  the  range  of  experimental  binding  energies  of 
shallow  donors. 

The  task  that  lies  ahead,  therefore,  is  the  construction  of 
impurity  pseudopotentials  that  are  internally  consistent  with  a 
pseudopotential  band-structure  calculation.  Evidence  that  is 
available  thus  far  indicates  that  the  task  is  very  sensitive.  The 
calculations  of  Ref.  3 (see,  e.g..  Table  1 of  Ref.  3)  show  that 
the  net  contribution  of  Umklapp  terms  in  Si  is  the  result  of 
subtle  cancellations.  Almost  the  entire  intervalley  matrix  ele- 
ment for  valleys  on  the  same  axis  comes  from  the  G»(331) 
contribution.  The  host  potential  vD  used  for  the  band-structure 
calculation,  however,  and  most  commonly  used 
pseudopotentials3 1 have  no  Fourier  components  at  such  high  G 
values.  In  fact,  in  keeping  with  the  empirical-pseudopotential 
spirit,  Shindo  and  Nara1  imposed  a cutoff  on  the  sum  over  G. 
If  a similar  cutoff  were  used  in  the  calculations  of  Ref.  3,33  the 
net  value  of  the  intervalley  matrix  element  would  be  reduced 
by  almost  an  order  of  magnitude.  Calculations  of  the  addition- 
al terms  arising  from  the  retention  of  Umklapp  contributions 
[cf.  eq.  (15)]  is  even  more  sensitive,  since  it  involves  summing 
over  bands  as  well.34  Their  net  contribution  will  again  depend 
on  the  choice  of  host  and  impurity  pseudopotentials. 

In  conclusion,  we  have  derived  rigorous  multivalley  EM 
equations  for  impurity  states  in  semiconductors.  They  can 
form  the  framework  for  accurate  calculations  if  sufficiently 
accurate  and  internally  consistent  host  and  impurity  pseudopo- 
tentials are  constructed.  If  the  latter  task  is  accomplished,  the 
calculations  would  still  be  highly  demanding.  In  the  case  of 
shallow  donors  in  Ge,  the  intervalley  matrix  elements  which 
cause  the  ground-state  splitting  are  of  order  0. 1-1.0  meV.35 
Since  these  quantities  would  have  to  be  calculated  by  summing 
over  reciprocal  lattice  vectors  and  bands,  it  appears  that  the 
theory  will  be  more  useful  for  double  donors  in  Ge  and  moder- 
ately deep  donors  in  Si  and  GaP  (EB  ~ 100-300  meV).  In  such 
cases,  one  may  have  to  include  contributions  from  subsidiary 
minima  in  a straightforward  generalization  of  the  theory,18 
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while  uncertainties  of  order  10-30  meV  would  not  be  impor- 
tant. 
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FIGURE  CAPTION 

Fig.  1.  (a)  The  standard  view  of  the  Si  conduction  bands  in 

the  Brillouin  zone  centered  at  I*.  The  dashed  curve 
is  a second  band,  which  is  neglected  in  the  one- 
valley  approximation,  (b)  View  of  the  same  bands 
from  a Brillouin  zone  centered  at  X,  showing  that 
the  problem  is  intrinsically  a two-band  problem, 
(c)  The  camel’s-back  form  of  the  bands  of  GaP. 
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The  acceptor  Hamiltonian  proposed  in  19S5  by  Kohn  and  Luttinger  is 
extended  to  introduce  the  q -dependence  of  electronic  screening  and  the 
coupling  with  the  split-off  valence  band.  Accurate  solutions  of  the  re- 
sulting equations  by  the  irreducible-tensor  technique  allow  a good  in- 
terpretation of  the  spectra  of  the  shallow  acceptors  in  Si  and  Ge. 


DURING  the  last  few  years,  the  developments  in  the 
purification  methods  of  various  semiconductors  have 
made  possible  higher  resolution  spectroscopy  of  im- 
purities. The  shallow  acceptors  in  Si  and  Ge  have  received 
particular  attention  [1,2]  and  the  spectral  position  of 
many  excited  states  has  been  determined  with  accuracy 
of  0.01  meV.  On  the  theoretical  side,  even  though  the 
shallow-impurity  problem  has  been  formulated  over 
twenty  years  ago  by  Kittel  and  Mitchell  [3]  and 
Luttinger  and  Kohn  [4] , good  solutions  to  the  proposed 
effective  mass  equations  do  not  exist  yet.  The  presently 
available  theoretical  results  [5]  have  qualitative  value 
only  and  are  not  adequate  for  the  interpretation  of  the 
recent  high  resolution  data  [1,2]  and,  sometimes  they 
even  fail  in  the  description  of  the  less  accurate,  older 
data  [6-8].  For  example,  the  excited  acceptor  states  in 
Ge  [6] , the  large  ionization  energy  of  acceptors  in  Si  [7] 
and  the  existence  of  an  excited  state.  23 .4  meV  above  the 
ground  state,  observed  in  the  Raman  spectroscopy  of 
Si : B [8] , are  not  yet  understood.  These  difficulties 
have  often  been  considered  as  failures  of  the  effective 
mass  approximation. 

In  this  letter  we  report  accurate  solutions  of  the 
acceptor  effective-mass  Hamiltonian,  and  show  that  how, 
after  proper  inclusion  of  q -dependent  screening  and 
coupling  with  the  split-off  valence  band,  many  of  the 
difficulties  mentioned  above  are  lifted.  The  present 
results,  which  have  been  obtained  by  the  “irreducible- 
tensor  technique”  [5],  can  be  considered  for  all  practi- 
cal purposes  as  the  “exact”  solution  of  the  effective- 
mass  acceptor. 

The  effective-mass  Hamiltonian  for  acceptors  in 
semiconductors  can  be  written  as  [5,9] : 


X j [/*>  x /«>]  J + [/X*)  X /«>]  S + [/**>  X /«>]  «_4J 


0) 


This  Hamiltonian  includes  coupling  with  the  split-off 
valence  band  and  q -dependent  diagonal  dieletric 
screening.  In  (l),p  denotes  the  hole  linear  momentum 
operator,  / is  the  angular  momentum  operator  corre- 
sponding to  spin  1,  and  £ is  the  spin  $ of  the  hole.  The 
definitions  for  the  tensor  operators  Pa>,  Ii2>  and  their 
products  are  the  same  as  in  [5].  The  parameters  p and 
S describe  the  energy  dispersion  of  the  holes  near  the 
center  of  the  Brillouin  zone  and  are  simply  related  [5] 
to  the  Kohn  and  Luttinger  parameters  and  y3. 

The  quantity  2 is  the  valence-band  spin-orbit  splitting 
measured  in  units  of  the  effective  Rydberg  where  e_is 
the  high  frequency  dielectric  constant  of  the  host  crystaL 
Finally  e(q)  is  the  q-dependent  dielectric  function 
which,  for  convenience,  in  actual  calculations,  has  been 
expressed  as  [10]. 


a-ii 


aa* 

B,  + < I1 


(2) 


where  the  parameters  Af  and  in  (2)  are  chosen  so  that 
expression  (2)  fits  the  computed  values  of  e(q). 

Hamiltonian  (1)  has  been  solved  with  the  variational 
technique  using  symmetrized  combinations  of  the  basis 
functions: 


(3) 


3h> 


which  transform  like  a specific  row  of  any  given  rep- 
resentation of  the  symmetry  group  Of,  of  the 
Hamiltonian.  The  angular  and  spin  dependent  factor  in 
(3)  is  defined  as  in  Edmonds  [1 1].  L is  the  orbital 
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Fig.  1 . Comparison  between  theoretical  and  experimental  acceptor  states  in  Si  and  Ge.  The  experimental  data  are 

those  referenced  in  text. 


Table  1.  Ionization  energy  for  single  acceptors  in  silicon 
using  a q dependent  and  a q independent  screening.  The 
dielectric  ftinction  used  is  that  labelled  c in  [10].  The 
results  using  various  truncations  to  the  solution  are  shown 


Tm mx 

Binding  energy 
(meV) 

Main 

Split-off 

q independent  q dependent 

screening 

screening 

2 

0 

33.3 

37.6 

4 

0 

33.8 

38.5 

6 

0 

33.9 

38.7 

8 

0 

33.9 

38.7 

2 

2 

37.8 

46.7 

4 

4 

44.3 

63  3 

6 

6 

44  3 

73.2 

8 

8 

44.4 

74  3 

angular  momentum  and  I + S can  assume  the  values 
3/2  or  1/2  which  correspond  to  the  main  and  split-off 
valence  bands,  respectively.  This  shows  how  our  pro- 
cedure incorporates  the  coupling  between  different 
valence  bands.  The  infinite  set  of  basis  functions  is 
truncated  by  including  all  possible  functions  (3)  with 
L < L — . The  radial  functions  /J(r)  are  expressed  as 
linear  combinations  of  a sufficiently  large  set  of  expo- 
nential functions  with  fixed  exponents.  Finally,  the 
matrix  elements  of  Hamiltonian  (1)  between  the  basis 
functions  (3)  are  obtained  using  the  “reduced-matrix- 
element  technique”  [11]  and  their  actual  evaluation  is 
done  by  computer. 

The  spherical  model  previously  proposed  by  the 
authors  [5]  considers  the  angular  momentum  Fas  a good 


Fig.  2.  Lowest  acceptor  states  in  silicon  with  symmetry 
r*.  r;,  r;,  r?  and  r*  as  function  of  the  spin-orbit  split- 
ting A.  The  vertical  arrow  indicates  the  electronic  tran- 
sition observed  in  the  Raman  spectrum  of  Si : B. 

quantum  number,  and  corresponds  to  including  in  the 
basis  functions  only  those  orbital  angular  momenta  L 
which  are  compatible  with  a given  F.  The  resulting  basis 
set  contains  at  most  two  functions  with  J = 3/2  and  one 
with  J**  ill.  The  small  number  of  basis  functions  makes 
the  spherical  model  very  simple  and  attractive,  but 
generally  limits  its  validity  to  qualitative  predictions  only. 
The  quantitative  interpretation  of  experimental  data  re- 
quires the  complete  procedure  reported  in  this  letter. 
Convergence  tests,  which  will  be  reported  elsewhere, 
show  that  the  choice  of  LmtK  * 7 is  sufficient  to  provide 
results  with  1%  accuracy.  This  choice  implies  a basis  set 
of  20-30  basis  functions  as  defined  in  (3).  We  have 
applied  the  above  procedure  to  Si  and  Ge.  These 
materials,  for  which  high  resolution  acceptor  spectra  are 
available  [1,2]  and  for  which  the  parameters  7i , 7j , 7j  , 
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and  e_  are  known  very  accurately,  [10]  are  the  ideal 
ground  for  comparison  between  theory  and  experiment. 
Figure  1 gives  our  theoretical  predictions  for  the  excited 
acceptor  states  in  Si  and  Ge  together  with  available  ex- 
perimental data. 

The  results  represent  the  first  accurate  identification 
of  the  nature  and  symmetry  of  the  final  :tates  of  the 
experimentally  observed  infrared  transitions.  The  rel- 
evance of  the  two  new  features  introduced  in  the  present 
treatment,  i.e.  q -dependent  dieletric  screening  and 
coupling  with  the  split-off  band,  is  illustrated  in  Table  1 
for  the  acceptor  ionization  energy  in  Si.  The  results 
shown  in  the  table  explain  why,  up  to  now,  all  effective 
mass  calculations,  which  have  neglected  one  or  both  of 
the  above  effects,  did  not  predict  binding  energies  of 
acceptors  in  Si  larger  than  54meV  [10],  while  the  exact 
solution  of  the  effective -mass  Hamiltonian  gives  a much 
larger  value.  The  value  74  meV  reported  in  Table  1 was 
obtained  using,  for  the  dielectric  constant,  the  set  of 
parameters  called  c in  [ 10].  Using  the  set  of  parameters 
labelled  a and  b in  [ 10) , we  obtain  94  and  1 30  meV 
respectively.  The  calculated  value  for  the  binding 
energy  is  quite  sensitive  to  the  choice  of  the  dieletric 
screening  and  an  accurate  determination  of  e(q)  is 
necessary.  The  experimental  ionization  energies  for 
acceptor  is  Si  are  44.4, 68.9, 72.7  and  156.2  for  B,  Al, 
Ga  and  In,  respectively.  The  calculated  binding  energy 
should  be  compared,  as  shown  for  donors  by  Pantelides 
[12],  to  the  isocoric  acceptor  in  Si,  i.e.  Al,  and  we  see 
that  the  theoretical  value  is  considerably  higher  than  the 
observed  value.  For  acceptors  in  Ge  the  effective  mass 
predicts  for  the  ionization  energy  1 1 .2  meV  and  the  ex- 
perimental binding  energy  for  Ge  : Ga  is  1 1 .3  meV.  We 
emphasize  that  screened  coulomb  potentials  give  a good 
description  of  isocoric-acceptor  ground  states  only  when 
accurate  effective-mass  solutions  are  used.  Preliminary 
calculations  of  the  acceptor  binding  energy  in  GaAs, 
indicate  that  the  effective-mass  prediction  falls  in  be- 
tween the  ionization  energies  of  the  isocoric  acceptors 
Ge  and  Zn  substituting  the  anion  and  cation,  respect- 
ively. A more  precise  prediction  on  this  point,  requires  a 
better  determination  of  the  band  parameters  for  this 
semiconductor  and  an  accurate  treatment  of  polaron 
effects. 

A rather  old  puzzle  in  the  interpretation  of  acceptor 
spectra  in  semiconductors  is  a mysterious  line  observed 
in  the  Raman  spectrum  of  Si : B [8]  and  of  various 
acceptors  in  GaP  [13].  The  difficulty  in  the  interpretation 
of  this  line  is  that  the  final  electronic  state  involved  in 
the  optical  process  is  at  lower  energy  than  any  of  the 
states  seen  in  infrared  spectra,  whereas  existing  theor- 
etical models  do  not  predict  any  state  in  this  energy 
range.  Our  calculation  for  acceptors  in  Si  predicts  the 
existence  of  two-fold  degenerate  H state  23.8  meV 


above  the  ground  state.  The  experimental  Raman  line  is 
at  23 .4  me V and  degeneracy  and  symmetry  of  the  final 
state  are  consistent  with  our  identification.  The  pre- 
dicted r?  state  contains  a strong  mixture  of  d- like  orbital 
state  of  the  7 = 3/2  valence  band  and  s-like  states  of  the 
split-off  band.  The  low  energy  position  of  this  state  is  a 
consequence  of  the  small  spin-orbit  splitting  in  Si.  In 
fact,  as  shown  in  Fig.  2,  in  the  limit  of  vanishing  spin- 
orbit  splitting,  the  r?  state  would  be  degenerate  with 
the  acceptor  ground  state,  and  for  large  values  of  the 
spin-orbit  splitting,  higher  than  the  p-Uke  states.  The 
analysis  of  the  wave  function  of  the  r?  state  shows  57% 
s-like  character  from  the  split-off  band,  3 1%  4-like 
character  from  the  / = 3/2  valence  band,  and  the  re- 
maining from  higher  angular  moments  in  the  two  bands. 
The  large  s-like  component  from  the  split-off  band, 
explains  the  strong  intensity  of  the  observed  Raman  line. 
It  is  important  to  note  that,  even  though  the  binding 
energy  for  the  ground  state  in  silicon  strongly  depends 
on  the  choice  of  the  dielectric  function  used,  the  dis- 
tance of  the  Raman  level  from  the  ground  state  is 
remarkably  insensitive  to  its  choice.  In  fact  the  calcu- 
lated Raman  level  shifts  by  approximately  the  same 
amount  (within  0.1  meV)  as  that  of  the  ground  state, 
thus  maintaining  the  agreement  reported  above.  In 
addition.  Fig.  2 shows  that  the  effect  of  the  split -off 
band  on  the  acceptor  ground  state  is  large  even  for  large 
spin-orbit  splittings.  Furthermore  even  the  excited  states 
are  affected  by  the  spin-orbit  splitting  in  Si,  and  the 
acceptor  spectrum  looks  very  different  in  the  limit  of 
small  and  large  spin-orbit  splitting. 

In  conclusion,  we  have  shown  that  it  is  possible  to 
obtain  a good  interpretation  of  the  acceptor  states  in 
semiconductors  within  the  effective  mass  approximation 
provided  an  accurate  solution  of  these  equations  is  ob- 
tained. This  means  that  for  s-like  states  one  has  to  in- 
clude in  the  basis  set  expansion  terms  up  to  L - 6,  and 
up  to  L - 7 for  p-Iike  states.  The  inclusion  of  the  split- 
off  band  is  very  important  and,  when  considered,  it 
allows  an  understanding  of  the  acceptor  Raman 
spectrum  in  silicon.  Finally,  the  problem  of  an  accurate 
determination  of  the  ground  state  binding  energy  is  still 
open.  We  have  seen  that  small  variations  in  the  dielectric 
function  for  silicon  produce  large  effects  in  this  quantity . 
For  the  first  time,  however,  the  theoretical  binding 
energy  is  larger  than  the  experimental  isocoric  value. 

This  is  very  encouraging  since  the  use  of  pseudopotentials 
(weaker  than  real  potentials)  should  now  be  able  to 
provide  a better  description  of  the  isocoric  acceptor  and 
of  the  dependence  of  the  acceptor  binding  energy  with 
the  impurity  atom. 


*This  work  was  supported  in  part  by  the  AFOSR  under 
contract  F49620-77-C-005. 
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Ike  bunding  energy  of  donor*  w docribad  by  • ample  mode  Hemiltonien  in  • nugneric  Bad  of  arbitrary 
strength  is  czloiburi  nsing  an  aspansinr  of  the  wave  function  in  Urns  of  spherical  hannonics.  All  the  matrix 
etomants  are  calculated  analytically  and  the  Hamihonias  it  solved  accurately.  We  than  calculate  the 
polarizability  by  applying  a vary  small  electric  Said.  Results  are  given  for  P,  As,  and  Sb  donors  in  silicon 
and  arc  discussed  with  reference  to  recent  experimental  data  by  Centner  and  Lea. 


L INTRODUCTION 

Very  recently,1  Caetner  and  Lee  have  performed 
static  (actually  low-frequency)  magneto-capaci- 
tance measurements  at  lour  temperatures  in  Si 
with  low  concentration  (NB)  of  donor  impurities, 
and  have  interpreted  their  results  as  indicating  a 
reduction  in  the  polarizability,  a(H),  of  weakly 
interacting  donors  by  a static  magnetic  field. 

Theoretically,  even  though  a great  deal  of  work 
has  been  done  regarding  the  energy  of  atoms  and 
donor  impurities  in  the  effective-mass  approxima- 
tion (EMA)**4  in  the  presence  of  a magnetic  field, 
not  much  exists  regarding  optical  properties,  such 
as  dipole-matrix  elements,  oscillator  strengths, 
etc.1  In  a previous  paper,*  one  of  us  (D.L.D.)  has 
calculated  the  magnetic -field  dependence  of  the 
polarizability  a for  a hydrogenic  atom  in  a weak 
magnetic  field.  A simple  analytical  expression 
tor  the  ground-state  wave  function  in  the  pres- 
ence of  a magnetic  field  was  obtained  by  a varia- 
tional method.  The  effect  of  an  electric  field,  to 
determine  the  polarizability,  was  then  calculated 
using  a variational  method  introduced  by  Hassd.T_ 
Analytical  expressions  for  a„  (E|H)  and  ax  (EJ-H) 
were  given.  These  analytical  expressions  are  how- 
ever quite  complicated,  the  variational  minimiza- 
tion procedure  in  the  presence  of  the  external 
field  is  nonlinear,  and  terms  of  higher  order  than 
H*  were  impracticable  to  consider.  In  addition, 
the  calculation  was  performed  for  hydrogenic 
atoms  only,  so  that  no  Impurity  dependence  in  a 
could  appear. 

The  treatment  of  the  energy  of  donor  impurities 
in  multivalley  semiconductors  is  quite  complica- 
ted, 4-10  and  very  recently  new  complications  have 
been  pointed  out.11  The  inclusion  of  external  fields 
would  make  a precise  treatment  of  the  problem 
even  more  difficult,  and  is  beyond  the  scope  of 
the  present  investigation.  In  this  paper  we  intro- 
duce an  approximation  to  describe  the  spatial  de- 
pendence of  the  ground  state  of  the  different  do- 
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nors;  in  view  of  the  crudeness  of  this  approxima- 
tion the  results  must  be  regarded  with  reservation. 

First,  we  describe  a method  which  provides  a 
good  description  of  the  polarizability  of  the  hydro- 
genic atom  for  magnetic  field  of  arbitrary  strength. 
Section  II  describes  this  method  and  the  solution. 

In  Sec.  in  we  introduce  a simple  model  Hamilto- 
nian for  donors.  An  impurity -dependent  potential 
is  introduced  and  fixed  so  as  to  give  the  measured 
donor  binding  energy  in  the  absence  of  an  external 
field.  Finally,  in  Sec.  IV  the  results  for  the  polar- 
izability are  presented  and  discussed  using  avail- 
able experimental  data. 

U.  HYDROGENIC  ATOM:  METHOD  AND  SOLUTION 

The  EMA  leads3  to  the  following  Schrodlnger 
equation  for  a hydrogenic  impurity  in  a uniform 
magnetic  field  Ut , in  the  z direction: 

3^«-v*-2/r+±y*(x*+/).  (1) 

Here  we  use  the  quantities 

o*»«**/m*e3,  R*-M*e*/2e*K*, 

y*(.2KiU,/tn**oacw«Mlft/2mmcR* , 

as  the  units  of  length,  energy,  and  magnetic  field; 
y * 1 is  the  magnetic  field  at  which  the  diamagnetic 
energy  is  equal  in  magnitude  to  the  Coulomb  ener- 
gy. [For  the  free  hydrogen  atom  y = 1 would  cor- 
respond to  Ht= 2.35x10*  G,  but  for  a donor  im- 
purity it  would  be  reduced  by  a factor 
- 10*3  or  less].  < is  the  static  dielectric  constant, 
and  m*  the  (scalar)  effective  mass.  For  purposes 
of  comparison  with  experiment  we  choose  m* 

• 0. 3m0  and  c « 11.4.  (The  value  m* • 0. 3m0  has 
been  chosen  so  as  to  give  the  same  binding  energy 
as  that  obtained  in  the  single  valley  case  with  an- 
isotropic masses.)  In  writing  (1)  we  have  omitted 
relativistic  terms,  which  yield  a spin-dependent 
polarizability.  These  terms  have  been  shown  to 
be  negligible  for  most  purposes.*  To  calculate 
the  polarizability  we  introduce  an  additional  elec- 
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trie -field  term 

3C,*  -e«*.E,f /!!•■  - {I , (3) 

where  I stands  for  a or  x,  parallel  or  perpendicu- 
lar to  the  direction  of  the  magnetic  field  If.  The 
Hamiltonian  in  the  presence  of  an  external  magne- 
tic and  electric  Held  is  therefore  given  by 

K-JP.+3C,.  (4) 

The  polarizability  a is  defined  by 

SUf.,  «)-*<*., 0)-* a** , (5) 


*»|s— |&}j£t.-|Vfs r 


r,l-r\ 

vT  ' 


Depending  on  die  direction  o££  we  obtain  the  com- 
ponents ax  if  tJL,  and  on  if  { |ff. 

We  now  seek  the  eigenfunctions  of  the  Hamilto- 
nian in  Eq.  (4).  We  write  the  solution  <p(x)  as 


i.e.,  we  expand  it  in  terms  of  spherical  harmonics. 
In  the  absence  of  any  external  field,  l and  m are 
good  quantum  numbers  and  therefore  there  would 
be  only  one  angular  term  in  the  expansion  (7). 

Once  the  magnetic  field  is  introduced,  I is  no 
longer  a good  quantum  number  but  m and  parity 
still  are.  Therefore  for  the  ground  state,  the  sum 
in  (7)  will  contain  only  terms  with  fixed  m and  even 
l . When  the  electric  field  is  turned  on,  parity  is 
no  longer  a good  quantum  number  and  m will  still 
be  if  the  electric  field  lies  in  the  same  direction 
as  the  magnetic  field,  otherwise  even  m is  no 
longer  a good  quantum  number  and  the  sum  in  (7) 
will  contain  terms  with  any  l and  m.  In  order  to 
evaluate  the  expectation  value  of  the  Hamiltonian 
(4),  we  must  calculate  its  matrix  elements  in  the 
basis  (7).  To  take  full  advantage  of  the  present 
method,  it  is  convenient  to  rewrite  (4)  in  terms 
of  Irreducible  components  of  tensor  operators 
whose  effect  on  the  basis  functions  (7)  can  be  cal- 
culated using  standard  angular-momentum  rela- 
tionships.1* Equation  (1)  can  be  rewritten  as 

Xom  -V*-2/r+  n Y*(2r*  -/T  Tj),  (8) 

where  T'm  represents  the  mth  irreducible  compo- 
nent of  the  tensor  of  rank  1 , as  defined  by  Ed- 
monds.1* Obviously  here 

r*-/T(2r*-x*-y*)-^r«vri,0.  («) 

The  electric  field  term  can  be  written 

(10) 


The  following  relationships  can  therefore  be  used 

(w*4]/(r)rl#.-r,,. 

JV(r)r,  (2T  ♦ 1)(3I  ♦ 1)]  »'• 

J2  L L'\/»  L L\ 

\0  m m7\0  0 0/ 
(-D-'r,.... . (13) 

r >/(r)T, . „ -/(r)r  £ [ (21'  ♦ l)(2f  * 1)]  */» 

X/1  L ,'V  1 I V\ 

\m"  m iw'/V  0 0 0/ 

(-iKr,w.  (14) 

Using  these  relationships  we  can  treat  exactly  and 
analytically  the  angular  part  of  the  matrix  elements 
of  (4)  in  the  basis  (7).  One  is  therefore  left  with 
systems  of  radial  differential  equations  tor  the 
/ 1 (r).  These  systems  can  be  solved  very  accu- 
rately by  making  the  following  anaatz, 

M 

/iW-ECue-V.  (15) 

/■i 

The  eigenvector  coefficients  C, ,,  and  the  corres- 
ponding eigenvalues  are  obtained  by  solving  the 
secular  determinant  of 

'ECtl(Mti-KSti)-0  . (18) 

The  else  of  the  secular  determinant  in  (16)  is  If 
x if,  where  N is  the  number  of  terms  in  (7)  and 
M the  number  of  terms  in  (18).  The  parameters 
Vj  are  not  treated  as  variational  parameters.  With 
a sufficiently  large  number  of  terms  (if)  describing 
the  physically  interesting  spatial  range  of  the  wave 
function,  the  specific  choice  of  the  q,’s  does  not 
matter,  as  discussed  more  extensively  in  Sec.  IV. 
We  also  note  that  It  is  not  necessary  to  include 
any  powers  of  r with  each  t"1*11*1*1  to  (15). 


ID.  MODEL  HAMILTONIAN  POX  DONOR  NPURITIES 


The  first  successful  theoretic.'!  inrsstlgntlon  of 
donor  spectra  to  semiconductors  was  given  by  Kahn 
and  Lutttnger1*  using  the  EMA.  in  indirect -gap 
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TABLE.  L.  Values  of  the  parameter  K (In  atomic  units), 
which  are  obtained  by  requiring  that  the  calculated  donor 


binding  energies  equal  the  experimental  ones.  We  have 
used  « » 11.4  and  m*  - 0.3m0.  The  energy  unit  la  meV. 

Material 

K 

£»  (calc.) 

E%  (expt.) 

Hydrogenic 

40 

31.41 

P 

0.S74 

45.45 

45.47 

Aj 

0.515 

53.70 

53.60 

Sb 

0.608 

42.68 

42.69 

materials,  the  multivalley  nature  of  the  conduction 
band  gives  rise  to  a multiple!  structure  of  the  do- 
nor-impurity ground  state,  because  the  intervalley 
components  at  the  impurity  potential  remove  the 
degeneracy  between  the  zero-order  states  obtained 
independently  for  each  valley.*  This  “valley -orbit” 
interaction  is  quite  substantial,  as  it  Increases  the 
binding  energy  of  the  ground  state  by  about  35%  for 
P in  Si,1*  and  should  therefore  be  included  in  the 
theoretical  description  of  these  states.  An  accu- 
rate description  of  this  effect  is  very  difficult, 
however,  as  has  long  been  appreciated  and  re- 
cently emphasized  anew.11  Furthermore,  the 
donor-binding  energy  depends  on  the  given  impuri- 
ty, e.g.,  P,  Sb,  and  As  donors  in  Si  have  different 
binding  energies.1*  A proper  analysis  of  these 
“chemical  shifts”  is  not  yet  available,  and  we  do 
not  attempt  one  here.  Instead,  we  use  the  sim- 
plest formulation  possible  which  Incorporates  in 
some  average  way  the  effects  described  previously. 
We  do  so  by  introducing  a phenomenological  im- 
purity dependent  potential  of  the  form 

TABLE  □.  Energy  of  the  hydrogen  ground  state  ae  a 
function  of  the  magnetic  field.  Comparison  with  Cab  lb 
tt  Ml.  (Ref.  2)  la  also  shown.  The  energy  unit  Is  the  ef- 
fective Rydberg. 


y 

Mu  (present) 

Ru  (Cablb  st  si.) 

0.0 

-1.00000 

-1.00000 

0.1 

-0.99608 

-0.99808 

0.2 

-0.98076 

-0.98076 

0.3 

-0.96037 

-0.96841 

0.4 

-0.92921 

-0.92923 

0.5 

-0.69442 

-0.89447 

0.6 

-0.86492 

-0.06494 

0.7 

-0.81145 

-0.811 42 

0.8 

-0.76457 

-0.764  57 

0.9 

-0.71474 

-0.71473 

1.0 

-0.66234 

-0.66241 

1.5 

-0.37071 

-0.37076 

2.0 

-0.04443 

-0.044  50 

2.5 

0.30493 

0.304  90 

3.0 

0.67094 

0.670  87 

4.0 

1.438  4 

1.438  4 

5.0 

2.2392 

2.2392 

TABLE  m.  Ground-state  binding  energy  for  P,  Aa, 
and  Sb  donor  Impurities  in  Si,  using  the  model  potential 
described  In  Sec.  m.  The  energy  unit  Is  the  effective 
Rydberg. 


y 

p 

As 

Sb 

0.0 

1.44729 

1.70983 

1.35908 

0.2 

1.437 10 

1.702  09 

1.34775 

0.4 

1.40821 

1.67979 

1.31866 

0.6 

1.36401 

1.64495 

1.26753 

0.8 

1.307  69 

1.59970 

1.20641 

1.0 

1.241  70 

1.545  83 

1.13523 

2.0 

0.817  36 

1.18807 

0.68282 

3.0 

0.30128 

0.74047 

0.13800 

4.5 

—0.26573 

0.24107 

-0.457  52 

5.0 

-0.865  67 

-0.292  55 

-1.08562 

10.0 

-4.12349 

-3.237  00 

-4.47914 

V(r)« 

-(sVcrH  l+(«- 

iv ")  • 

(17) 

The  coefficient  K determines  the  strength  of  the 
screening  of  the  potential  V.  In  fact  when  A— “ 
Eq.  (17)  reproduces  the  static  screened  potential, 
while  for  K - 0 it  gives  the  unscreened  Coulomb 
potential.  The  value  of  K for  each  Impurity  is  de- 
termined by  requiring  the  calculated  binding  en- 
ergy to  agree  with  the  observed  one  for  that  im- 
purity. The  use  of  this  potential  does  not  compli- 
cate the  solution  of  the  problem  because  it  does 
not  involve  any  new  type  of  integrals.  In  Table  I 
we  give  the  values  of  K for  As,  P,  and  Sb  in  sili- 
con. 


FIG.  1.  Plot  of  the  calculated  radial  part  of  the  ground- 
state  wave  function  [actually  r */(r)]  as  a function  of  r 
in  the  absence  of  applied  fields  for  various  Impurities 
in  St. 
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FIG.  2.  Hot  of  tho  expectation  value  of  r,  (»7  , aa  a 
function  of  reduced  magnetic  field  for  H,  P.  Aa.  and 
9b  tmpurtttea  In  SI. 
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FIG.  3.  CalculMed  magnitude  of  the  radial  wave 
function  a the  origin  for  various  tmpuritlea  la  St  aa  a 
function  of  reduced  magnetic  field. 


IV.  NUMERICAL  RESULTS  AND  DISCUSSION 

We  first  address  Die  question  of  convergence  of 
the  binding  energy  as  a function  of  the  basis  func- 
tions in  (15).  In  principle,  the  exponents  »j4  should 
be  allowed  to  change  in  a variational  way  and  to 
depend  on  any  given  state.  In  practice,  however, . 
as  we  have  previously  seen,"  the  tj,  can  be  fixed 
and  not  considered  as  variational  parameter,  if 
one  chooses  a sufficient  number  of  terms  in  (15) 
and  if  the  jj,’  s are  selected  in  a sensible  way.  We 
have  seen  that  it  suffices  (1)  to  include  15  terms  in 
(15),  and  (ii)  to  choose  the  largest  and  smallest 
i)t  to  be  50  and  0.05,  respectively,  with  the  inter- 
mediate ones  chosen  in  geometrical  progression, 
to  insure  convergence  in  eigenvalues  to  better  than 
one  part  in  10*.  The  next  question  regards  how 
many  terms  should  be  included  in  (7).  This  num- 
ber obviously  increases  with  the  strength  of  the 
magnetic  field.  In  Table  a the  ground-state  bind- 
ing energy,  using  six  terms  in  (7),  is  given.  In 
the  last  column  the  results  of  Cablb  «t  al .*  (OFF) 
are  also  given  for  comparison.  We  see  that  our 
results  reproduce  accurately  the  sophisticated 
calculations  of  OFF  for  y’ s up  to  5.  We  have  also 
compared  our  results  for  y up  to  100  with  the  re- 
sults of  Baldereschi  and  Bassani*  and  the  agree- 
ment is  excellent,  thus  showing  that  the  present 
method  can  handle  any  value  of  the  magnetic  field. 
In  all  the  subsequent  calculations  we  shall  always 
use  up  to  10  (i.e.,  six  terms)  in  the  expres- 
sion. In  Table  III  we  report  the  magnetic  field  de- 
pendence of  the  ground  state  binding  energy  for 
P,  As,  and  Sb  in  Si.  respectively,  using  the  model 
potential  described  in  the  previous  section.  In 
Fig.  1 we  show  the  ground -state  wave  function  In 


the  absence  of  any  external  field  for  the  four  eases 
considsred  in  this  pspsr.  The  expectation  value 
at  r for  the  ground  state,  as  a function  at  the  re- 
duced magnetic  field,  is  shown  In  Fig.  2,  for  the 
various  impurities  while  the  value  at  the  origin, 

| #<0)|  Is  given  in  Fig.  3. 

To  determine  the  polarizability  a we  calculate 
the  binding  energy  using  a very  small  value  for 
the  electric  field  and  then  use  expression  (6).  To 
ensure  that  the  electric  field  Is  indeed  "Infinitesi- 
mal, " that  is,  only  virtually  perturbs  the  wave 
functions,  we  have  performed  calculations  with  ( 

- 0.05, 0.025,  0.01  with  essentially  the  same  results 
(to  four  significant  figures).  The  introduction  at 


FIG.  4.  Calculated  polarlzdslltty  components  In 
effective  unite  as  a function  of  y*  for  the  hydrogenic 
impurity  In  SI. 


FIG.  S.  Same  as  Fig.  4 for  the  P impurity. 

the  electric  field  requires  the  odd  L terms  in  ex- 
pression (7)  and  in  the  calculation  we  hare  used 
all  odd  terms  up  to  LM+ 1.  For  U.  * 0,  the  cal- 
culated polarizability  a for  hydrogen  can  be  com- 
pared with  the  exact  value. M We  obtain  for  a a 
value  which  compares  very  well  (i.e.,  to  four  sig- 
nificant figures)  with  the  exact  value  | . In  Fig. 

4 we  show  the  magnetic  field  dependence  of  ax  and 
<«n  . These  results  compare  reasonably  well  with 
those  obtained  in  a previous  paper  in  the  low  y 
region  by  one  of  us  (D.L.O.),  and  in  addition,  they 
are  valid  also  for  large  y. 

In  Figs.  5-7  the  polarizabilities  for  P,  As  and 
Sb  are  given  and  we  see  that  the  magnetic -fie Id 
dependence  gets  weaker  with  increasing  binding 
energy.  This  is  reasonable  since  the  wave  func- 
tion becomes  more  localized  with  increasing  bind- 
ing energy  and  therefore  the  external  fields  are 


r* 

FIG.  7.  Same  as  Ftg.  4 for  Sb  impurity. 


less  effective  in  perturbing  the  wave  functions. 

The  comparison  with  experiment  is  to  be  per- 
formed with  caution  since  we  have  used  here  a 
crude  approximation  for  the  description  of  the 
donor-impurity  dependence.  The  calculated  de- 
crease of  a for  U,  * 0 in  going  from  Sb  to  As  is  in 
agreement  with  observation.  In  fact,  we  predict 
11  x ltPaJ  for  Sb,  8.2  x llPaJ  for  P,  4.9x  10*0$  for 
As,  while  the  experiment  gives  21  x 10*0*,  16 
x 10!a$,  and  6.7xio5aJ,  respectively.  Also  in 
agreement  with  experiment  is  the  weaker  depen- 
dence of  a with  magnetic  field  in  going  from  Sb  to 
As.  The  calculated  absolute  value  of  these  depen- 
dences however  is  quite  different  from  the  ob- 
served ones.  Specifically,  the  calculated  depen- 
dences are  much  weaker  than  for  the  hydrogenic 
atom  which  in  turn  are  much  weaker  than  the  ex- 
perimental ones.  Even  though  these  results  are 
obtained  using  a crude  approximation,  the  fact  that 
the  experimentally  observed  dependence  of  a on 
is  much  larger  than  even  the  hydrogenic  case  is 
puzzling.  Ideally  one  should  perform  calculations 
using  a more  realistic  model  for  the  donor  Impuri- 
ties which  include  a proper  description  of  the  elec- 
tron anisotropy,  intervalley  coupling,  and  central- 
cell effects  but  in  fact  one  would  expect  that,  since 
the  observed  binding  energies  for  the  various  do- 
nors are  always  greater  than  for  the  hydrogenic 
case,  any  calculation,  even  a more  sophisticated 
one,  would  give  the  same  qualitative  results  as 
those  obtained  here.  More  experimental  informa- 
tion Is  also  required  to  interpret  the  disagreement 
between  theory  and  experiment. 
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A theory  foT  Wannier  exdtons  in  a magnetic  field  of  arbitrary  strength, 
including  both  anisotropy  and  exciton— phonon  coupling,  is  presented. 
Both  the  anisotropy  and  the  exciton— phonon  coupling  are  shown  to 
considerably  quench  the  shifts  of  the  energy  levels  in  the  magnetic  field. 
Results  are  presented  for  Pbl?  and  compared  with  recent  experimental 
data. 
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THE  INTEREST  in  layer  structure  materials  has 
increased  appreciably  during  recent  years  [1  ] . Optical 
properties,  in  particular,  have  been  studied  in  order  to 
better  understand  the  electronic  structure.  One  of  the 
most  studied  cases  has  been  Pbla  in  which  band-edge 
exdtons  have  been  observed  by  several  investigators  [2] . 
Theoretical  interpretations  of  the  experiment  differ 
considerably  with  regard  to  both  the  nature  of  the  states 
responsible  for  the  observed  lines  and  the  ground  state 
binding  energy  for  which  estimates  range  from  16  meV 
[3]  to  180  meV  [4] . Recently,  in  an  effort  to  obtain 
more  information  about  the  exdton  spectrum,  its 
variation  with  an  applied  external  magnetic  field  has 
been  studied  [4] . Unexpectedly,  the  exdtonic  lines  did 
not  show  any  shifts. 

In  this  letter  we  show  theoretically  that  the  absence 
of  shifts  is  a direct  consequence  of  the  exdton— phonon 
interaction  and  the  anisotropy.  On  the  basis  of  our 
results  we  condude  that  previously  used  isotropic 
descriptions  of  the  exdton,  which  neglect  either  of  the 
above  two  effects,  are  inadequate  in  polar,  layered 
materials.  We  present  the  first  theory  which  incorporates 
both  the  exciton— phonon  interaction  and  the  anisotropy 
and  which  is  valid  for  arbitrary  magnetic  field  strength. 
Both  the  exciton  ground  and  exdted  states  are  calculated 
accurately.  The  available  experimental  data  for  Pbla  are 
interpreted. 

In  [5]  it  was  shown,  that  the  interaction  of  LO- 
phonons  with  an  isotropic  exciton  can  be  described  by 
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spherically  symmetric,  exdton-radius-dependent  screen- 
ing of  the  electron— hole  Coulomb  interaction.  The 
generalization  of  this  result  to  anisotropic  materials  is 
extensively  discussed  in  {6] , where  it  is  shown  that  for 
not  too  large  anisotropies  (0  < a < 0.6)  one  can  obtain 
reliable  energies  using  the  spherically  symmetric  screen- 
ing function  in  which  averaged  dielectric  constants  and 
masses  are  used.  The  results  of  [5]  and  [6]  can  be 
transferred  to  the  present  case  because  the  magnetic  field 
terms  in  the  operator  are  not  changed  by  the  derivation 
of  the  effective  potential,  as  can  be  seen  in  [7] . 

As  an  effective  Hamiltonian,  therefore,  for  the  case 
of  Faraday  geometry  (with  a symmetrical  gauge)  we  use 

H = -Ar  + ad* -j  + AV«t(r)  + Et 

+ J(x2  +y2)—j7tol.  ( 1 ) 

4 Mi 

with  the  anisotropy  constant  a - 1 — eioMi/eoiMt  • The 
z -component  of  the  angular  momentum  operator  in 
units  of  h is  denoted  as  l,.  We  have  used  as  energy  and 
length  units: 


R$ 


«o 


where  Ry  and  aB  are  the  atomic  Rydberg  and  the  Bohr 
radius.  Perpendicular  “1”  and  parallel  “H"  refer  to 
quantities  perpendicular  and  parallel  to  the  preferred 
axis,  which  we  define  to  be  the  z-axis.  The  field  strength 
is  given  in  terms  of  the  field  energy  over  the  effective 
Rydberg  energy  which  is  y * hwc/Rp  =*  uBX/Rynit  and 
HB  is  the  Bohr  magneton.  The  reduced  masses  are 
abbreviated  as  Mi  and  and  the  coefficient  in  the 
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Fig.  1.  Diamagnetic  shift  of  the  first  excited  state  as  a 
function  of  the  magnetic  field  strength  y for  Pbl2 . The 
full  line  is  the  present  result,  including  anisotropy  and 
exd ton— phonon  interaction;  the  dashed  line  represents 
the  hydrogenic  approximation. 

Zeeman  term  is  l/<4  * — l/mu.  The  additional 

electron-hole  potential  AP^r)  due  to  phonons  is 
given  [5]  as  (in  our  units): 

(l-£)  (1  £ 

(2) 

where  l/fi  * l/fll,  — l//Hh.  The  coefficients^/,  the 
exponents  bt  and  the  self  energy  E,  are  given  in  [S]  as 
functions  of  mh,  <«, «.  and  hc>>u).  In  the  present 
paper  we  can  use  the  same  analytical  form  of  the  poten- 
tial, replacing  those  material  parameters  by  the  follow- 
ing averages  [6]. 

The  quantities  fPI„  and  i.  are  defined  correspondingly. 

It  is  important  to  point  out,  that  we  use  averaged 
material  data  only  in  the  additional  potential  AKrff(r), 
whereas  in  the  other  terms  the  anisotropy  is  treated 
exactly. 

The  solution  of  (1)  can  be  expanded  in  terms  of 
spherical  harmonics 

*N,m(r)  - I (3) 

i 

where  m and  the  parity  are  good  quantum  numbers  as 
can  be  simply  seen  from  (1).  We  use  N as  a labelling 
index  of  the  states  and  all  / values  in  the  sum  are  either 
odd  or  even.  The  states  withm  =*  0 and  1,2, 3,4 
are  therefore  the  hydrogenic  Is,  2s,  3s,  3 d states  if  we 
approach  the  isotropic  limit  in  the  zero  field  case.  In  the 
resulting  system  of  radial  differential  equations  we  use 


MAGNETIC  FIELD  STRENGTH  (y) 

Fig.  2.  (a)  Diamagnetic  shift  of  the  second  and  third 
excited  states  for  PMa . For  comparison,  we  also  give  the 
linear  shift  of  the  band  edge  (lowest  Landau  level). 

(b)  Square  of  the  envelop  function  at  the  origin,  fs  = 

1 1 pN  o(0)!a  in  units  of  l]iml  for  the  second  (N=3)  and 
third  (N  = 4)  excited  state  as  functions  of  the  magnetic 
field  strength  y. 

for  the  coefficient  functions  the  following  expansion: 

FR.fr)  = Ci,Xm  e'd|r.  (4) 

< 

The  minimization  with  respect  to  the  linear  coeffedents 
C reduces  the  problem  to  the  diagonalize tion  of  the 
secular  matrix  Hy  — ESy.  With  7 terms  in  the  spherical 
harmonics  expansion  and  20  exponentials,  the  converg- 
ence of  the  resulting  eigenvalues  is  better  than  one  part 
in  10* . An  extensive  analysis  of  the  convergence  and  the 
choice  of  the  parameters  d,  will  be  given  elsewhere  [8] . 

Before  presenting  our  numerical  results  for  Pbl] , 
we  briefly  discuss  our  choice  of  the  material  parameters 
given  in  Table  1 . From  the  interpretation  of  the  “2 p ±t  ”, 
“3 p ±, ”,  and  the  “2p0"  states,  FrbhUch  et  aL  [9] 
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obtained  the  anisotropy  constant  a * 0.37,  the  band 
gap  Eg  = 2.527  eV  and  an  effective  Rydberg  Rp  * 

28  meV.  Using  the  value  [12]  e0i 2  3 4 5 6 7 26.4  and  [3] 

Hut  Hi  = 4.4,  we  find  e0|  = 9 5,  which  compares  reason- 
ably well  with  the  estimates  given  in  [10] , and  mj.  3 

0.512mo.  The  LO-phonon  frequency  lieu  = 13.4 meV 
and  e-i  » «_«  * 6.1  are  also  taken  from  [10] . The 
determination  of  the  average  masses  ifl,  and  tlih  require 
knowledge  of  four  individual  masses.  From  experiment 
we  know  only  Hi 3 0.51 6 m0  and  Hi  3 2-276m0.  Band 
structure  calculations  [11, 12]  suggest  a large,  almost 
isotropic  hole  mass,  and  they  show  that  the  parallel 
electron  mass  is  smaller  than  the  hole  mass.  Therefore, 
we  use  in  an  isotropic  hole  mass  mhx  = mhl  = 

mh  = 6m0.  This  gives  = 3.67 mo  and  mtL  - 0.56mo 
for  the  electron  masses.  A smaller  value  for  fh h,  e.g. 

2.5  m0  would  have  resulted  in  a value  mtll  = 25.7  m0 
which  is  unreasonable.  It  is  important  to  emphasize  that 
the  above  choice  for  fhh  is  not  critical  with  respect  to 
the  energy  shifts  in  the  magnetic  field.  In  fact,  as  will  be 
pointed  out,  othei  choices  for  ff ih  make  the  quenching 
due  to  the  exdton-phonon  interaction  even  more 
pronounced.  The  averaged  masses  used  in  AV^r)  and 
E,  are  fft,  = 0.7764 m0  and  ffl h = 6.0 m0. 

To  study  the  level  shifts  in  a magnetic  field,  we  have 
chosen  a range  of  0 < y < 0.10,  which  corresponds  to  a 
magnetic-field  range  of  0 < X < 250  kG.  (The  highest 
fields  ever  applied  to  Pblj  is  3C  - 180  kG.)  The  ground 
state  is  practically  unaffected  by  the  field.  The  same 
result  would  also  be  true  for  the  ground  state  in  a 
hydrogenic  description,  because  of  the  small  value  of  y. 

The  relatively  greater  sensitivity  of  excited  states  to 
the  magnetic  field  (even  for  small  y values)  exhibits  the 
crucial  role  played  by  the  exdton-phonon  interaction 
and  anisotropy.  Figure  1 shows  substantial  quenching 
of  the  diamagnetic  shift  Et(y)  — £j(0)  of  the  first 
exdted  state.  It  is  to  be  noted  that,  even  for  the  highest 
applied  field  of  180  kG,  i.e.  y = 0.072,  the  energy  shift 
is  only  0.3  meV,  which  is  below  the  resolution  of  the 


experiment  [4] . The  hydrogenic  description  would  have 
predicted  a shift  of  1.5  meV.  Figure  shows  that  the 
hydrogenic  model  is  completely  inadequate  for  all  values 
of  the  magnetic  field. 

The  results  for  the  second  and  third  exdted  states 
are  shown  in  Figs.  2(a)  and  2(b),  where  we  have  plotted 
E*(y)  and  £4(7)-  The  conduction  band  edge  in  the  zero 
field  case  is  used  as  the  zero  of  energy.  Its  linear  shift 
(lowest  Landau  level)  in  the  field  is  also  plotted.  The 
lower  state  E3(y)  is  predominantly  i-like  at  zero  field 
strength.  For  this  state,  also,  the  diamagnetic  shift  is 
below  the  experimental  resolution.  The  hydrogenic 
model  predicts  a shift  of  4 meV  (for  H=  180  kG)  which 
again  reveals  its  inadequacy.  The  higher  state  £4(7) 
shows  a larger  diamagnetic  shift,  which  in  prindple 
could  be  observed.  This  state,  however,  is  the  “3d” 
state,  which  has  only  12%  s-character  (due  to  the  aniso- 
tropy) in  the  absence  of  a magnetic  field.  With  increasing 
magnetic  field  the  intensity  of  this  state  increases  as 
shown  in  Fig.  2(b).  Since,  however,  the  energy  position 
of  this  state  is  very  near  the  quasicontinuum  of  the 
spectrum,  it  is  probably  very  difficult  to  resolve.  The 
peculiar  behaviour  of  the  oscillator  strength  for  this 
state  near  7 = 0.08  is  direct  evidence  of  the  complex 
interaction  of  this  level  with  higher  levels.  In  fact,  for 
7 near  0.08  the  “3d”  and  “4*”  states  would  become 
degenerate  if  their  crossing  were  not  forbidden  by 
symmetry. 

In  summary,  we  have  presented  here  a theory,  which 
properly  takes  into  account  the  influences  of  the  aniso- 
tropy and  the  exciton-phonon  interaction  on  exdton 
levels  in  a magnetic  field.  Ground  and  excited  states 
have  been  calculated  and  for  all  levels,  a very  large 
quenching  of  the  diamagnetic  shift  is  found.  The 
magnetooptical  data  [4]  for  Pbla  are  in  agreement  with 
the  theoretical  predictions.  The  above  effects  play  an 
important  role  in  a large  clan  of  materials  and  the 
present  theory  should  prove  useful  in  their  description. 
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